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(57) Abstract 



A gene (cDNA) encoding a bovine myostatin protein. The nucleic acid coding sequence is identified as SEQ ID NO: 1 and the protein 
sequence is identified as SEQ ID NO:2. A mutant gene (SEQ ID NO: 3) in which the coding sequence lacks an 1 1-base pair consecutive 
sequence (SEQ ID NO: 1 1) of the sequence encoding bovine protein having myostatin activity has been sequenced. It has been shown that 
cattle of the Belgian Blue breed homozygous for the mutant gene lacking myostatin activity are double— muscled. A method for determining 
the presence of muscular hyperplasia in a mammal is described. The method includes obtaining a sample of material containing DNA from 
the mammal and ascertaining whether a sequence of the DNA encoding (a) a protein having biological activity of myostatin, is present, and 
whether a sequence of the DNA encoding (b) an allelic protein lacking the activity of (a), is present. The absence of (a) and the presence 
of (b) indicates the presence of muscular hyperplasia in the mammal. 
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fraction as measured by hydroxyproline content (Hanset etaL, 1982). Double-muscled animals 
were shown to have a reduced feed intake with improved feed conversion ratio (Hanset et a/., 
1987). An important economic benefit of double-muscled animals, in contrast to conventional 
animals, is the substantial increase in selling price and net income for the farmer (Hanset e/aA, 
5 1987). 

One of the most thorough series of studies on double-muscling is that of Hanset 
and colleagues in the Belgian Blue Breed. Objective criteria of muscular development, such as 
dressing-out percentage, lean and fat percentage, plasma and red cell creatine and creatinine 
concentrations, were measured on nearly 150 randomly selected animals raised in standardized 

10 conditions. These studies clearly revealed abnormal, bimodal distributions of the double- 
muscled phenotype and objectively confirmed the visual classification traditionally performed by 
breeders on double-muscled and conventional animals. The phenotypic distribution was 
resolved using a maximum likelihood procedure into two component normal populations with a 
common variance which revealed mean differences of three to four standard deviations 

15 depending on the trait. This suggested the presence of an allele having a major effect on 

muscular development with a population frequency close to 50% (Hanset and Michaux, 1985b). 
The most convincing evidence in favour of such an allele, however, came from experimental 
crosses involving double-muscled Belgian Blue sires and Holstein Friesian dairy cows (the latter 
animals having very poor muscular development). While F1 offspring showed a phenotypic 

20 distribution very similar to their Holstein Friesian dams, backcrossing these F1 's to double- 
muscled sires produced a bimodal BC generation, clearly pointing towards the Mendelian 
segregation of a recessive u mh" (muscular hypertrophy) allele (Hanset and Michaux., 1985a). 

The same kind of experimental crosses were subsequently used to perform a 
whole genome scan using a microsatellite based marker map. To perform the linkage analysis, 

25 animals were classified as double-muscled or conventional. Very significant Logarithm of the 
Odds scores (lodscores) were obtained on chromosome 2 (> 17), and multi point linkage 
analysis positioned the mh locus at the centromeric end of this chromosome, at [2]centimorgan 
from the nearest microsatellite marker: TGLA44. The corresponding chromosomal region 
accounted for all the variance of the trait assumed to be fully penetrant in this experiment 

30 (Charlieref a/., 1995). 

In humans, genes coding for some forms of muscular abnormalities have been 
isolated, e.g. muscular dystrophy. The present invention provides for the gene which regulates 
the development of skeletal muscle only, as opposed to other types of muscle, e.g. smooth or 
cardiac muscle. The present invention may provide an understanding of the role of the GDF-8 

35 gene or its receptor in the regrowth of skeletal muscle in humans which only undergo a 
hyperplasic response. 
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MUTATIONS IN THE MYOSTATIN GENE CAUSE DOUBLE-MUSCLING IN MAMMALS 
Field of The Invention 

This invention relates to factors affecting muscle development in mammals, 
especially livestock. In particular, this invention relates to the cloning of the myostatin gene, a 
5 member of the TGF-p superfamily, its involvement in muscular hyperplasia in livestock, and a 
method for determining myostatin genotypes. 

Description Of Related Art 

The TGF-p superfamily consists of a group of multifunctional polypeptides which 
control a wide range of differentiation processes in many mammalian cell types. GDF-8 is a 

1 0 member of the TGF-p superfamily. Ail members of this superfamily share a common structure 
including a short peptide signal for secretion and an N-terminal peptide fragment that is 
separated from the bioactive carboxy-terminal fragment by proteolytic cleavage at a highly 
conserved proteolytic cleavage site. The bioactive carboxy-terminal domain is characterized by 
cysteine residues at highly conserved positions which are involved in intra- and intermolecuiar 

15 disulfide bridges. The functional molecules are covalently linked (via a S-S bond) dimers of the 
carboxy-terminal domain (Masterson et a/., 1996). 

Recently, it was reported that mice deficient in the gene coding for GDF-8 were 
characterized by a generalized muscular hyperplasia (McPherron etai, 1997). The GDF-8 
deficient mice were produced by gene targeting using homologous recombination in embryonic 

20 stem cells, a method referred to as "gene knock-out". The murine generalized muscular 
hyperplasia appeared to be very similar in its expression to the muscular hyperplasia 
characterizing "double-muscled" cattle. This observation raised the intriguing possibility that the 
bovine gene coding for GDF-8 (i.e. the bovine evolutionary homologue of the mouse GDF-8 
gene) is involved in the bovine double-muscling phenotype. It also raised the possibility that the 

25 human gene coding for GDF-8 Q.e. the human evolutionary homologue of the mouse GDF-8 
gene) is involved in regulating muscular development in humans, specifically skeletal muscle 
genesis. Isolation of the human GDF-8 gene may have therapeutic uses/applications in the 
treatment of musculodegenerative diseases through upgrading or downgrading the expression of 
GDF-8. 

30 The occurrence of animals characterized by a distinct generalized muscular 

hypertrophy, commonly known as "double-muscled" animals, has been reported in several cattle 
breeds around the world. The first documented description of double-muscled cattle dates back 
as early as 1807 (Culley, 1807). One of the breeds in which this characteristic has been most 
thoroughly analyzed is the Belgian Blue Cattle Breed ("Belgian Blue Breed"). This is one of the 

35 only breeds where the double-muscled trait has been systematically selected for, and where the 
double-muscled phenotype is virtually fixed. A comparison of double-muscled and conventional 
animals within the Belgian Blue Breed, showed an increase in muscle mass by 20% on average, 
while all other organs were reduced in size (Hanset, 1986 and 1991). The muscular hypertrophy 
was shown to be an histological hyperplasia affecting primarily superficial muscles, 

40 accompanied by a 50% reduction in total lipid content and a reduction in connective tissue 



WO 99/02667 



-3- 



PCT/1B98/01197 



Summary of the Invention 

The present inventors have identified and sequenced a gene (cDNA and 
genomic) encoding a bovine myostatin protein. The nucleic acid coding sequence is identified as 
SEQ ID NO:1 and the protein sequence is identified as SEQ ID NO:2. The genomic bovine 
5 sequence is identified as SEQ ID NO:54. A mutant gene (SEQ ID NO:3) in which the coding 
sequence lacks an 11-base pair consecutive sequence (SEQ ID NO:1i) of the sequence 
encoding bovine protein having myostatin activity has been sequenced. It has been shown that 
cattle of the Belgian Blue breed homozygous for the mutant gene lacking myostatin activity are 
double-muscled. Other bovine mutations which lead to double-muscling in have also been 
10 determined, being identified herein as nt419(del7-ins10), Q204X, E226Xand C313Y, 
respectively. 

In one aspect, the present invention thus provides a method for determining the 
presence of muscular hyperplasia in a mammal. The method includes obtaining a sample of 
material containing DNA from the mammal and ascertaining whether a sequence of the DNA 
15 encoding (a) a protein having biological activity of myostatin, is present, and whether a sequence 
of the DNA encoding (b) an allelic protein lacking the activity of (a), is present. The absence of 
(a) and the presence of (b) indicates the presence of muscular hyperplasia in the mammal. 

Of course, the mutation responsible for the lack of activity can be a naturally 
occurring mutation, as is the case for the Belgian Blue, Asturiana, Parthenaise or Rubia Gallega 
20 breeds, shown here. 

The mammal can be a human, bovine, etc. 

There are several methods known for determining whether a particular 
nucleotide sequence is present in a sample. A common method is the polymerase chain 
reaction. A preferred aspect of the invention thus includes a step in which ascertaining whether a 

25 sequence of the DNA encoding (a) is present, and whether a sequence of the DNA encoding (b) 
is present includes amplifying the DNA in the presence of primers based on a nucleotide 
sequence encoding a protein having biological activity of myostatin. 

A primer of the present invention, used in PCR for example, is a nucleic acid 
molecule sufficiently complementary to the sequence on which it is based and of sufficient length 

30 to selectively hybridize to the corresponding portion of a nucleic acid molecule intended to be 
amplified and to prime synthesis thereof under in vitro conditions commonly used in PCR. 
Likewise, a probe of the present invention, is a molecule, for example a nucleic acid molecule of 
sufficient length and sufficiently complementary to the nucleic acid molecule of interest, which 
selectively binds under high or low stringency conditions with the nucleic acid sequence of 

35 interest for detection thereof in the presence of nucleic acid molecules having differing 
sequences. 

In preferred aspects, primers are based on the sequence identified as SEQ ID 
NO:7 (human cDNA sequence) or SEQ ID NO:54. 
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In another aspect, the invention is a method for determining the presence of 
muscular hyperplasia in a mammal which includes obtaining a sample of material containing 
mRNA from the mammal. Such method includes ascertaining whether a sequence of the 
mRNA encoding (A) a protein having biological activity of myostatin, is present, and whether a 
5 sequence of the mRNA encoding (B) a protein at least partially encoded by a truncated 

nucleotide sequence corresponding to substantially the sequence of the mRNA and lacking the 
activity of (A), is present The absence of (A) and the presence of (B) indicates the presence of 
muscular hyperplasia in the mammal. 

The mRNA encoding (A) and the truncated sequence can correspond to alleles 
10 of DNA of the mammal. 

Again, if an amplification method such as PCR is used in ascertaining whether a 
sequence of the mRNA encoding (A) is present, and whether a sequence of the mRNA encoding 
(B) is present, the method includes amplifying the mRNA in the presence of a pair of primers 
complementary to a nucleotide sequence encoding a protein having biological activity of 
15 myostatin. Each such primer can contain a nucleotide sequence substantially complementary, 
for example, to the sequence identified as SEQ ID NO:7. The truncated sequence can contain at 
least 50 consecutive nucleotides substantially corresponding to 50 consecutive nucleotides of 
SEQ ID NO:7, for example. 

In another aspect, the invention is a method for determining the presence of 
20 muscular hyperplasia in a mammal which includes obtaining a tissue sample of containing 
mRNA of the mammal and ascertaining whether an mRNA encoding a mutant type myostatin 
protein lacking biological activity of myostatin is present. The presence of such an mRNA 
encoding a mutant type myostatin protein indicates the presence of muscular hyperplasia in the 
mammal. 

25 In another aspect, the invention thus provides a method for determining the 

presence of muscular hyperplasia in a bovine animal. The method includes obtaining a sample 
of material containing DNA from the animal and ascertaining whether DNA having a nucleotide 
sequence encoding a protein having biological activity of myostatin is present. The absence of 
DNA having such a nucleotide sequence indicates the presence of muscular hyperplasia in the 

30 animal. Ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin can include amplifying the DNA in the presence of primers based 
on a nucleotide sequence encoding a protein having biological activity of myostatin. 

In particular, the method can be carried out using a sample from an animal in 
which such a bovine animal not displaying muscular hyperplasia is known to have a nucleotide 

35 sequence which is capable of hybridizing with a nucleic acid molecule having the sequence 
identified as SEQ ID NO:1 under stringent hybridization conditions. 

It is possible that ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
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in the presenc of primers based on a nucleotide sequence encoding the N-terminal and the C- 
terminal, respectively, of the protein having biological activity of myostatin. 

Primers, say first and second primers, can be based on first and second 
nucleotide sequences encoding spaced apart regions of the protein, wherein the regions flank a 
5 mutation known to naturally occur and which when present in both alleles of a such an animal 
results in muscular hyperplasia. 

It can also be that DNA of such an animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
10 sequence of DNA of a such an animal displaying muscular hyperplasia is known to contain an 
11 -base pair deletion beginning at base pair no. 821 of the coding sequence, and said first primer 
is selected to be upstream of the codon encoding glutamic acid no. 275 and the second primer 
is selected to be downstream of the codon encoding aspartic acid no. 274. 

Also, a DNA of such an animal not displaying muscular hyperplasia might 
15 contain a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2. The coding 
sequence of DNA of such an animal displaying muscular hyperplasia might be known to contain 
an 1 1 -base pair deletion beginning at base pair no. 821 . A primer can be selected to span the 
nucleotide sequence including base pair nos. 820 and 821 of the DNA sequence containing the 
20 deletion. 

The animal can be of the Belgian Blue breed. 

In a particular aspect, ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of a primer containing at least a portion of a mutation known to naturally occur 
25 and which when present in both alleles of a said animal results in muscular hyperplasia. 

In another aspect, the invention is a method for determining the presence of 
muscular hyperplasia in a bovine animal which includes obtaining a sample of the animal 
containing mRNA and ascertaining whether an mRNA encoding a protein having biological 
activity of myostatin is present in the sample. The absence of said mRNA indicates the presence 
30 of muscular hyperplasia in the animal. 

A sample containing mRNA can be muscle tissue, particularly, skeletal muscle 

tissue. 

In a particular aspect, the invention is a method for determining the presence of 
double muscling in a bovine animal, involving obtaining a sample of material containing DNA 
35 from the animal and ascertaining whether the DNA contains the nucleotide sequence identified 
as SEQ ID NO:11 in which the absence of the sequence indicates double muscling in the animal. 

in a particular aspect, the animal is of the Belgian Blue breed. 

In another aspect, the invention is a method for determining the myostatin 
genotype of a mammal, as may be desirable to know for breeding purposes. The method 
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includes obtaining a sample of material containing nucleic acid of the mammal, wherein the 
nucleic acid is uncontaminated by heterologous nucleic acid; ascertaining whether the sample 
contains a ® nucleic acid molecule encoding a protein having biological activity of myostatin; and 
ascertaining whether the sample contains an (n) allelic nucleic acid molecule encoding a protein 
5 lacking biological activity of myostatin. The mammal can be bovine. 

In another aspect, the subject is human and (i) includes a nucleic acid sequence 
substantially homologous (in the sense of identity) with the sequence identified as SEQ ID NO:7. 

The invention includes a method of increasing muscle mass of a mammal 
having muscle cells in which myostatin is expressed, the method comprising administering to the 
10 mammal an effective amount of a nucleic acid molecule substantially complementary to at least 
a portion of mRNA encoding the myostatin and being of sufficient length to sufficiently reduce 
expression of the myostatin to increase the muscle mass. In a particular aspect, the mammal is 
bovine. 

In another embodiment, the invention is a method of increasing muscle mass of 
15 a mammal, including administering to the mammal an effective amount of a nucleic acid 

molecule having ribozyme activity and a nucleotide sequence substantially complementary to at 
least a portion of mRNA encoding myostatin and being of sufficient length to bind selectively 
thereto to sufficiently reduce expression of the myostatin so as to increase the muscle mass. 

The invention includes a diagnostic kit, for determining the presence of muscular 
20 hyperplasia in a mammal from which a sample containing DNA of the mammal has been 
obtained. The kit includes first and second primers for amplifying the DNA, the primers being 
complementary to nucleotide sequences of the DNA upstream and down stream, respectively, of 
a mutation in the portion of the DNA encoding myostatin which results in muscular hyperplasia of 
the mammal, wherein at least one of the nucleotide sequences is selected to be from a non- 
25 coding region of the myostatin gene. The kit can also includes a third primer complementary to a 
naturally occurring mutation of a coding portion of the myostatin gene. 

A particular diagnostic kit, for determining the genotype of a sample of 
mammalian genetic material, particularly bovine material includes a pair of primers for amplifying 
a portion of the genetic material corresponding to a nucleotide sequence which encodes at least 
30 a portion of a myostatin protein, wherein a first of the primers includes a nucleotide sequence 
sufficiently complementary to a mutation of SEQ ID NO:1 to prime amplification of a nucleic acid 
molecule containing the mutation, the mutation being selected from the group of mutations 
resulting from: (a) deletion of 11 nucleotides beginning at nucleotide 821 of the coding portion of 
SEQ ID NO:1 ; (b) deletion of 7 nucleotides beginning at nucleotide 41 9 of the coding sequence 
35 and insertion of the sequence AAGCATACAA in place thereof; (c) deletion of nucleotide 204 of 
the coding sequence and insertion of T in place thereof; (d) deletion of nucleotide 226 of the 
coding sequence and insertion of T in place thereof; and (e) deletion of nucleotide 313 of the 
coding sequence and insertion of A in place thereof; and combinations thereof. The second of 
the pair of primers is preferably located entirely upstream or entirely downstream of the selected 
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mutation or mutations. In one kit, a first said primer spans mutation (a) and further comprising a 
third primer which is sufficiently complementary to the nucleotide sequence identified as SEQ ID 
NO:11 to prime amplification of a nucleic acid molecule containing SEQ ID NO:11. In another (or 
the same kit), a first said primer is sufficiently complementary to the inserted sequence of 
5 mutation (b) to prime amplification of a nucleic acid molecule containing mutation (b) and further 
comprising a third primer which is sufficiently complementary to the sequence corresponding to 
the 7 nucleotide deletion of mutation (b) to prime amplification of a nucleic acid molecule 
containing the 7 nucleotide deletion of mutation (b). In another (or the same kit), a first said 
primer spans mutation (c) and further comprising a third primer which is sufficiently 
1 0 complementary to the sequence spanning the corresponding region lacking mutation (c) to prime 
amplification of a nucleic acid molecule lacking mutation (c). In another (or the same kit), a first 
said primer spans mutation (d) and further comprising a third primer which is sufficiently 
complementary to the sequence spanning the corresponding region lacking mutation (d) to 
prime amplification of a nucleic acid molecule lacking mutation (d). In another (or the same kit), 
15 a first said primer spans mutation (e) and further comprising a third primer which is sufficiently 
complementary to the sequence spanning the corresponding region lacking mutation (e) to 
prime amplification of a nucleic acid molecule lacking mutation (e). 

The invention includes a purified protein having biological activity of myostatin, 
and having an amino acid sequence identified as SEQ ID NO:2, or a conservatively substituted 
20 variant thereof. The invention includes a purified bovine protein having biological activity of 
myostatin or a purified human protein (SEQ ID NO:8) having biological activity of myostatin. 

The invention includes an isolated nucleic acid molecule encoding a foregoing 
protein. Particularly, the invention includes an isolated nucleic acid molecule comprising a DNA 
molecule having the nucleotide sequence identified as SEQ ID NO:1 or SEQ ID NO:3 or SEQ ID 
25 NO:7 or which varies from the sequence due to the degeneracy of the genetic code, or a nucleic 
acid strand capable of hybridizing with at least one said nucleic acid molecule under stringent 

hybridization conditions. 

The invention includes isolated mRNA transcribed from DNA having a sequence 

which corresponds to a nucleic acid molecule of the invention. 
30 The invention includes isolated DNA in a recombinant cloning vector and a 

microbial ceil containing and expressing heterologous DNA of the invention. 

The invention includes a transfected cell line which expresses a protein of the 

invention. 

The invention includes a process for producing a protein of the invention, 
35 including preparing a DNA fragment including a nucleotide sequence which encodes the protein; 
incorporating the DNA fragment into an expression vector to obtain a recombinant DNA molecule 
which includes the DNA fragment and is capable of undergoing replication; 
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transforming a host cell with the recombinant DNA molecule to produce a transformant which 
can express the protein; culturing the transformant to produce the protein; and recovering the 
protein from resulting cultured mixture. 

The invention includes a method of inhibiting myostatin so as to induce 
5 increased muscle mass in a mammal, comprising administering an effective amount of an 
antibody to myostatin to the mammal. 

The invention includes a method of increasing muscle mass in a mammal, by 
raising an autoantibody to the myostatin the in the mammal. Raising the autoantibody can 
include administering a protein having myostatin activity to the mammal. 

10 The invention includes a method of increasing muscle mass in a mammal 

including administering to the mammal an effective amount of an antisense nucleic acid or 
oligonucleotide substantially complementary to at least a portion of the sequence identified as 
SEQ ID NO:1 or SEQ ID NO:5 t or SEQ ID NO:7. The portion can be at least 5 nucleotide bases 
in length or longer. The mammal can be a bovine and the sequence can be that identified as 

15 SEQIDNO:1. 

The invention includes a method of inhibiting production of myostatin in a 
mammal in need thereof, including administering to the mammal an effective amount of an 
antibody to the myostatin. 

The invention includes a probe containing a nucleic acid molecule sufficiently 
20 complementary with a sequence identified as SEQ ID NO:1 , or its complement, so as to bind 
thereto under stringent conditions. The probe can be a sequence which is between about 8 and 
about 1195 nucleotides in length. 

The invention includes a primer composition useful for detection of the presence 
of DNA encoding myostatin in cattle. The composition can include a nucleic acid primer 
25 substantially complementary to a nucleic acid sequence encoding a bovine myostatin. The 
nucleic acid sequence can be that identified as SEQ ID NO:1 . 

The invention includes a method for identifying a nucleotide sequence of a 
mutant gene encoding a myostatin protein of a mammal displaying muscular hyperplasia. The 
method includes obtaining a sample of material containing DNA from the mammal and probing 
30 the sample using a nucleic acid probe based on a nucleotide sequence of a known gene 

encoding myostatin in order to identify nucleotide sequence of the mutant gene. In a particular 
approach, the probe is based on a nucleotide sequence identified as SEQ ID NO:1 , SEQ ID NO;5 
or SEQ ID NO:7. Preferably, the probe is at least 8 nucleic acids in length. The step of probing 
the sample can include exposing the DNA to the probe under hybridizing conditions and further 
35 comprising isolating hybridized nucleic acid molecules. The method can further include the step 
of sequencing isolated DNA. The method can include the step of isolating and sequencing a 
cDNA or mRNA encoding the complete mutant myostatin protein. The method can include a 
step of isolating and sequencing a functional wild type myostatin from the mammal not displaying 
muscular hyperplasia. 
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The method can include comparing the complete coding sequence of the 
complete mutant myostatin protein with, if the coding sequence for a functional wild type 
myostatin from such a mammal is previously known, (1) the known sequence, or if the coding 
sequence for a functional wild type myostatin from such a mammal is previously unknown, (2) 
5 the sequence determined according to claim 63 or claim 66, to determine the location of any 

mutation in the mutant gene. 

The invention includes a primer composition useful for the detection of a 
nucleotide sequence encoding a myostatin containing a first nucleic acid molecule based on a 
nucleotide sequence located upstream of a mutation determined according to a method of the 
10 invention and a second nucleic acid molecule based on a nucleotide sequence located 
downstream of the mutation. 

A probe of the invention can include a nucleic acid molecule based on a 
nucleotide sequence spanning a mutation determined according to the invention. 

The invention includes an antibody to a protein encoded by a nucleotide 
15 sequence identified as SEQ ID NO:1, SEQ ID NO:3 or SEQ ID NO:7, or other protein of the 
present invention. 

The invention includes a transgenic bovine having a genome lacking a gene 
encoding a protein having biological activity of myostatin; a transgenic mouse having a genome 
containing a gene encoding a human protein having biological activity of myostatin or containing 
20 a gene encoding a bovine protein having biological activity of myostatin; a transgenic bovine 

having a gene encoding a bovine protein having biological activity of myostatin and heterologous 
nucleotide sequence antisense to the gene. The transgenic bovine can include a gene encoding 
a nucleic acid sequence having ribozyme activity and in transcriptional association with the 
nucleotide sequence antisense to the gene. 

25 The invention includes a transgenic mammal, usually non-human, having a 

phenotype characterized by muscular hyperplasia, said phenotype being conferred by a 
transgene contained in the somatic and germ ceils of the mammal, the transgene encoding a 
myostatin protein having a dominant negative mutation. The transgenic mammal can be male 
and the transgene can be located on the Y chromosome. The mammal can be bovine and the 

30 transgene can be located to be under the control of a promoter which normally a promoter of a 
myosin gene. 

Another transgenic mammal, usually non-human, of the invention has a 
phenotype characterized by muscular hyperplasia, in which the phenotype is conferred by a 
transgene having a sequence antisense to that encoding a myostatin protein of the mammal. 
35 The mammal can be a male bovine and the transgene can be located on the Y chromosome. 
The transgene can further include a sequence which when transcribed obtains an mRNA having 
ribozyme activity. 

A transgenic non-human mammal of the invention having a phenotype 
characterized by muscular hyperplasia, can have the phenotype inducible and conferred by a 
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myostatin gene flanked by J oxP sides and a Cre transgene under the dependence of an 
inducible promoter. 

A transgenic non-human male mammal of the invention having a phenotype 
characterized by muscular hyperplasia, can have the phenotype conferred by a myostatin gene 
5 flanked by J oxP sides and a Cre transgene located on the Y chromosome. 

The invention includes a method for determining whether a sample of 
mammalian genetic material is capable of a conferring a phenotype characterized by muscular 
hyperplasia, comprising ascertaining whether the genetic material contains a nucleotide 
sequence encoding a protein having biological activity of myostatin, wherein the absence of said 
1 0 sequence indicates the presence of muscular hyperplasia in the animal. 

Brief Description Of Drawings 

In describing particular aspects of the invention, reference is made to the 

accompanying drawings, in which: 

Figure 1 is a schematic summary of genetic, physical and comparative mapping 
15 information around the bovine mh locus. A multi-point lodscore curve obtained for the mh locus 

with respect to the microsatellite marker map is shown. Markers that were not informative in the 

pedigree used are shown between brackets; their map position is inferred from published 

mapping data. Markers and the YACs from which they were isolated are connected by arrows. 

The RH-map of the relevant section of human HSA2 is shown, with the relative position in cR of 
20 the ESTs used. Stippled lines connect microsatellite and Type I markers with their respective 

positive YACs. YACs showing cross-hybridizing SINE-PCR products are connected by the red 

boxes. 

Figure 2(a) shows electropherograms obtained by cycle-sequencing the 

myostatin cDNA sequence from a double-muscled and a conventional animal, showing the 
25 nt821del(11) deletion (SEQ ID NO:11) in the double-muscled animal. The primers used to 

amplify the fragment encompassing the deletion from genomic DNA are spaced apart from the 

remaining nucleotides. 

Figure 2(b) shows the amino-acid sequence of the murine (top row), bovine 

normal (middle row) and bovine nt821del(11) (bottom row) allele. The putative site of proteolytic 
30 processing is boxed, while the nine conserved cysteines in the carboxy-terminal region are 

underlined. The differences between the normal and nt821del(11) bovine allele are indicated by 

the double underlining. 

Figure 3 is a schematic representation of the bovine myostatin gene with position 

and definition of the identified DNA sequence polymorphisms. The "A" (clear) boxes correspond 
35 to the untranslated leader and trailer sequences (large diameter), and the intronic sequences 

(small diameter) respectively. The a B\ "C", and "D" boxes correspond to the sequences coding 

for the leader peptide, N-terminal latency-associated peptide and bioactive carboxyterminal 

domain of the protein respectively. Small M e\ T and "g" arrows point towards the positions of the 
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primers used for intron amplification, exon amplification and sequencing and exon sequencing 
respectively; the corresponding primer sequences are reported in Table 1. The positions of the 
identified DNA sequence polymorphisms are shown as M h", T or M j" lines on the myostatin gene 
for silent, conservative and disrupting mutations respectively. Each mutation is connected via an 

5 arrow with a box reporting the details of the corresponding DNA sequence and eventually 
encoded peptide sequence. In each box, the variant sequence is compared with the control 
Holstein-Friesian sequence and differences are highlighted in color. 

Figure 4 shows the distribution of identified mutations in the various breeds 
examined. The order of the myostatin mutations correspond to Figure 3. Ail analyzed animals 

10 were double-muscled except for the two Holstein-Friesian and two Jerseys used as controls 
(column 1). 

Detailed Description Of Preferred Embodiments 

The method used for isolating genes which cause specific phenotypes is known 
as positional candidate cloning. It involves: (i) the chromosomal localization of the gene which 

15 causes the specific phenotype using genetic markers in a linkage analysis; and (ii) the 

identification of the gene which causes the specific phenotype amongst the "candidate" genes 
known to be located in the corresponding region. Most of the time these candidate genes are 
selected from available mapping information in humans and mice. 

The tools required to perform the initial localization (step (i) above) are 

20 microsatellite marker maps, which are available for livestock species and are found in the public 
domain (Bishop et a/., 1994; Barendse etei, 1994; Georges et a/., 1995; and Kappes, 1997). 
The tools required for the positional candidate cloning, particularly the YAC libraries, (step (ii) 
above) are partially available from the public domain. Genomic libraries with large inserts 
constructed with Yeast Artificial Chromosomes ("YAC") are available in the public domain for 

25 most livestock species including cattle. When cross-referencing the human and mice map, it is 
necessary to identify the positional candidate, which is available at low resolution but needs to be 
refined in every specific instance to obtain the appropriate level of high resolution. A number of 
original strategies are described herein to achieve this latter objective. For general principles of 
positional candidate cloning, see Collins, 1995 and Georges and Andersson, 1996. 

30 In order to allow for cross-referencing between the bovine and human gene 

map as part of the positional candidate cloning approach, HSA2q31-32 (map of the long arm of 
human chromosome 2, cytogenetic bands q31-32) and BTA2q12-22 (map of the arm of bovine 
chromosome 2, cytogenetic bands q12-22) were integrated on the basis of coincidence bovine 
YAC's as described below. 

35 Using a previously described experimental [(normal x double-muscled) x double- 

muscled] backcross population comprising 108 backcross individuals, the mh locus was recently 
mapped by linkage analysis to the centromeric tip of bovine chromosome 2 (BTA2), at 3.1 
centiMorgan proximal from the last marker on the linkage map: TGLA44 (Charlier et al. t 1995). It 
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was also known from previous work that pro-a(lll) collagen (Col3AI) was located in the same 
chromosomal region as the mh locus. Col3Al has been mapped to BTA2q12-22 by in situ 
hybridization (Solinas-Toldo et a/., 1995), while a Col3Al RFLP marker was shown to be closely 
linked to TGLA44 (6=2%)(Fisher et al , 1 996). This identifies the region flanking Col3AI on the 
5 human map, i.e. HSA2q31-32 r as the likely orthologous human chromosome segment. This 
assumption is compatible with data from Zoo-FISH experiments (Solinas-Toldo et a/. ( 1995) as 
well as mapping data of Type I markers on somatic cell hybrids (O'Brien et a/. ( 1993), which 
establish an evolutionary correspondence between segments of HSA2q and BTA2. 

In order to refine the correspondence between the HSA2q31-33 and BTA2q12- 

10 22 maps, Comparative Anchored Tagged Sequences or CATS, i.e. primer pairs that would 

amplify a Sequence Tagged Site or STS from the orthologous gene in different species (Lyons et 
a/., 1996), were developed for a series of genes flanking Col3A1 on the human map and for 
which sequence information was available in more than one mammal. In addition to Co!3Al t 
working CATS were obtained for a2(V) collagen (Col5A2), inositol poiyphosphate-1 phosphatase 

15 (INPP1), tissue factor pathway inhibitor precursor (TFP/j, titin (TTN), n-chimaerin (CHA/), 

glutamate decarboxylase 67 (GAD1), Cytotoxic T-lymphocyte-associated protein 4 (CTLA4) and 
T-cell membrane glycoprotein CD28 (CD2B). The corresponding primer sequences are given in 
Table 1 . 
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Table 1: 






CATS 


5 
10 


INPP1 

COL3A1 

COL5A2 

TFPI 

TTN 

CHN 

GAD1 

CTLA4 

CD28 


UP: 5' CAGCAAAGTCCTTAATGGTAACAAGC 3* 
UP: 5* CCCCATATTATGGAGATGAACCG 3' 
UP: 5* GCAAACTGGGYGGRAGCAAGACC 3' 
UP: 5* AAGCCWGATTTCTGCTTYTTGGAAG 3' 
UP: 5* GGTCGTCCTACACCAGAAG 3* 
UP: 5* TCTCMAAAGTCGTCTGTGACAATC 3' 
UP: 5' RCTGGTCCTCTTCACCTCAGAAC 3' 
UP: 5' AGGTYCGGGTGAC DGTGCTKC 3' 
UP: 5* AGCTGCARGTATWCCTACAAYCT 3' 


DN: $ GGGTCACTGAAGAAAACGTCCTG 3' 
DN: 5* AGTTCAGGATGGCAGAATTTCAG 3' 
DN: 5 TTSTTC CTGGGCTTTTATTGAG AC 3* 
DN: S TGCCMAGGCAHCCRCCRTACTTGAA 3* 
DN: 5' GGTTGACATTGTCAAGAACAAG 3* 
DN: S TGYTC RTTTTCTTTC AG AGTTG C 3' 
DN: 5* ACATTGTCVGTTCCAAAGCCAAG 3* 
DN: 5' TGGRTACATGAGYTCCACCTTGC 3' 
DN: 5' GTYCCRTTGCTCYTCTCRTTGYC 3' 




Microsatellite markers 


15 
20 


TGLA44 

BULGE27 

BULGE23 

BM81124 

BULGE28 

BULGE20 

BM3627 

ILSTS026 

INRA40 


UP:5* AACTGTATATTGAGAGCCTACCATG 3' 
UP: 5' CTACCTAACAGAATGATTTTGTAAG 3' 
UP: 5* ACATTCTCTCACCAATATGACATAC 3' 
UP: 5' GCTGTAAGAATCTTCATTAAGCACT 3* 
UP: 5* AGGCATACATCTGGAGAGAAACATG 3* 
UP: 5' CAGCAGGTCTGTTGAAGTGTATCAG 3' 
UP: 5' CAGTCCATGGCACCATAAAG 3' 
UP: 5' CTGAATTGGCTCCAAAGGCC 3' 
UP: 5' TCAGTCTCCAGGAGAGAAAAC 3' 


DN: 5' CACACCTTAGCGACTAAACCACCA 3 1 
DN: 5' AGTGTTCTTGCCTAGAGAATCCCAG 3* 
DN: 5* TAAGTCACCATTACATCCTTAGAAC 3 
DN: 5' CCTGATACATGCTAAGGTTAAAAAC 3" 
DN: 5* C AGAGG AGC CTAG C AG G CTAC C GTC 3' 
DN: 5' AGTGGTAGCATTCACAGGTAGCCAG 3* 
DN: 5* TCCGTTAGTACTGGCTAATTGC 3 ( 
DN: 5' AAACAGAAGTCCAGGGCTGC 3' 
DN: 5' CTCTGCCCTGGGGATGATTG 3' 




Bovine Myostatin primers 


25 


GDF8.19 

GDF8.11 

GDF8.12 

GDF8.25 

GDF8.15 

GDF8.21 


5' AATGTATGTTTATA I I I AC CTGTTCATG 3' 
5' ACAGTGTTTGTGCAAATCCTGAGAC 3* 
5' CAATGCCTAAGTTGGATTCAGGTTG 3' 
5' CTTGCTGTAACCTTCCCAGGACCAG 3' 
5' TCCCATCCAAAGGCTTCAAAATC 3' 
5' ATACTCWAGGCCTAYAGCCTGTGGT 3' 




30 


Reading from left to right and down the table, the sequences given in Table 1 are identified as 
SEQ ID NO:12 to SEQ ID NO:53 t respectively. 



These CATS were used to screen a 6-genome equivalent bovine YAC library by 
PCR using a three-dimensional pooling strategy as described by Libert etai, 1993. The same 
YAC library was also screened with all microsatellite markers available for proximal BTA2, i.e. 
TGLA44, BM81124, BM3627, ILSTS026, INRA40 and TGLA431 (Kappes et aL, 1997). 

35 Potential overlap between the YACs obtained with this panel of STS's was 

explored on the basis of common STS content, as well as cross-hybridization between SINE- 
PCR product from individual YACs. From this analysis, three independent YAC contigs emerged 
in the region of interest: (i) contig A containing microsatellites TGLA44, BM81 124 and Type I 
marker INPP1] (\\) contig B containing Cot3AI and Col5A2\ and (iii) contig C containing 

40 microsatellite markers BM3627, ILSTS026 and INRA40, and Type I marker TFPI. 

None of the available microsatellites mapped to contig B, therefore this cluster 
of YACs could not be positioned in cattle with respect to the two other contigs. Available 
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mapping information in the human, however, allowed prediction of contig B's position between 
contigs A and C. To test this hypothesis, two new microsatellite markers were isolated from 
contig B, BULGE20 and BULGE28. BULGE20 proved to be polymorphic, allowing for 
genotyping of the experimental backcross population. 
5 In addition, to increase the informativeness of the markers available for contig A, 

two new microsatellite markers were developed from this contig: BULGE23 and BULGE27. 
BULGE23 proved to be polymorphic and was used to type the same pedigree material. 

All resulting genotypes were used to construct a linkage map using the ILINK 
program (Lathrop and Laiouel, 1984). The following most likely order and sex-averaged 

1 0 recombination rates between adjacent markers was obtained: [TGLA44-(0%)-BULG23]-(6,1%)- 
BULG20-(1 ,6%)-ILSTS026-(2.3%HNRA40-(7,1%)-TGLA431 . The position of BULGE20 between 
TGLA44 and 1LSTS026 confirmed the anticipated order of the three contigs. Figure 1 
summarizes the resulting mapping information. 

A multi point linkage analysis was undertaken using LINKMAP, to position the 

15 mh locus with respect to the new marker map. Linkage analysis was performed under a simple 
recessive model, assuming full penetrance for mh/mh individuals and zero penetrance for the 
two other genotypes. The LOD score curve shown in Figure 1 was obtained, placing the mh locus 
in the TGLA44-BULGE20 interval with an associated maximum LOD score of 26.4. Three 
backcross individuals were shown to recombine with the BULGE20 and distal markers, but not 

20 with TGLA44 and BULGE23, therefore placing the mh locus proximal from this marker. One 
individual, was shown to recombine with TGLA44 and BULGE23, but not with the more distal 
markers, therefore positioning the mh locus distal from TGLA44 and BULGE23. Given the 
relative position of these microsatellite markers with respect to INPP1 and Col3A! as deduced 
from the integration of the human and bovine map, these results indicated that the mh gene is 

25 likely located in a chromosome segment bounded by INPP1 and Col3AL 

Recently, McPherron etal. (1997) demonstrated that mice homozygous for a 
knock-out deletion of GDF-8 or myostatin, were characterized by a generalized increase in 
skeletal muscle mass. Using the published 2676bp murine myostatin cDNA sequence 
(GenBank accession number U84005), a Tentative Human Consensus (THC) cluster in the 

30 Unigene database was identified which represented three cDNA clones (221299, 300367, 
308202) and six EST (Expressed Sequence Tag) sequences (H92027, H92028, N80248, 
N95327, W07375, W24782). The corresponding THC covered 1296 bp of the human myostatin 
gene, showing an homology of 78.1% with the murine sequence when averaged over the entire 
sequence, and 91 .1% when considering only the translated parts of the human and murine 

35 genes (566bp). This THC therefore very likely corresponds to the human orthologue of the 

murine myostatin gene. Primers (5-GGCCCAACTATGGATATATTTG-3' (SEQ ID NO:9) and 5*- 
GGTCCTGGGAAGGTTACAGCA-3* (SEQ ID NO:10)) were thus prepared to amplify a 272 bp 
fragment from the second exon of human myostatin and used to genotype the whole-genome 
Genebridge-4 radiation hybrid panel (Walter et at., 1994). The RHMapper program (Slonim et 
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a/. f unpublished) was used to position the myostatin gene with respect to the Whitehead/MIT 
framework radiation hybrid map, placing it at position 948.7 cR of the HSA2 map with an 
associated lodscore > 3. Closer examination of the myostatin segregation vector and its 
confrontation with the vectors from all markers located in that region (Data Release 1 1 .9, May 
5 1 997) showed it to be identical to EST SGC38239 placed on the Whitehead/MIT radiation hybrid 
map (Hudson et a/., 1 995) at position 946.8 cR of HSA2. This places the human myostatin gene 
on the RH-map in the interval between Col3AI (EST WI16343 - 942.5 cR) and INPP1 (EST 
L08488 - 950.2 to 951 .2 cR)(Figure 1). Myostatin therefore appeared as a very strong positional 

candidate for the mh gene. 

1 0 To test the potential involvement of myostatin in the determinism of double- 

muscling in cattle, primer pairs were designed based on the available mouse and human 
myostatin sequence, with the objective to amplify the entire coding sequence from bovine cDNA 
using PCR (Polymerase Chain Reaction). Whenever possible, primers were therefore positioned 
in portions of the myostatin sequence showing 100% homology between mouse and human. 

1 5 Two primer pairs were identified that amplified what was predicted to represent 98.4% of the 
bovine coding sequence plus 74 bp of 3* untranslated sequence, in two overlapping DNA 
fragments, respectively 660 (primers GDF8.19 - GDF8.12) and 724 bp (primers GDF8.11 - 
GDF8.21) long. The expected DNA products were successfully amplified from cDNA 
generated from skeletal muscle of both a normal (homozygous +/+) (SEQ ID NO:1) and a 

20 double-muscled (homozygous mh/mh) (SEQ ID NO:3) animal, and cycle-sequenced on both 
strands. 

The nucleotide sequence corresponding to the normal allele presented 88.9% 
identity with the mouse myostatin sequence (SEQ ID NO:5) over a 1067 bp overlap, and 
contained the expected open reading frame encoding a protein (SEQ ID NO:2) showing 92.9% 

25 identity in a 354 amino-acid overlap with mouse myostatin (SEQ ID NO:6). As expected for a 
member of the TGFP superfamily, the bovine myostatin gene is characterized by a proteolytic 
processing site thought to mediate cleavage of the bioactive carboxy-terminal domain from the 
longer N-terminal fragment, and by nine cysteine residues separated by a characteristic spacing 
and suspected to be involved in intra- and inter-molecular disulfide bridges (McPherron and Lee, 

30 1996). 

The nucleotide sequence obtained from the mh allele was identical to the 
normal allele over its entire length, except for an 1 1 bp deletion involving nucleotides 821 to 831 
(counting from the initiation codon). This frame shifting deletion, occurring after the first cysteine 
residue of the carboxy-terminal domain, drastically disrupts the downstream amino-acid 
35 sequence and reveals a premature stop-codon after 13 amino acids, see Figure 2. The amino 
acid sequence encoded by the mutant nucleic acid sequence is identified as SEQ ID NO:4. This 
mutation disrupts the bioactive part of the molecule and is therefore very likely to be the cause of 
the recessive double-muscling phenotype. Following conventional nomenclature, this mutation 
will be referred to as nt821(def1 1). 
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To further strengthen the assumption of the causality of this mutation, primer 
pairs flanking the deletion (Figure 2) were prepared and the corresponding DNA segment from 
all animals from th experimental backcross population amplified. PCR was performed in the 
presence of dCTP 32 in order to radioactively label the amplification product. Amplification 
5 products were separated on denaturing polyacrytamide gels and detected by autoradiography. A 
188 bp product would be expected for the normal allele and a 177 bp product for the 
nt821(del11) allele. Correlation between phenotype and genotype was matched for the entire 
pedigree. All ten BBCB double-muscled sires were found to be homozygous for the 
nt821(det11) mutation, all 41 F1 females were heterozygous, while 53 double-muscled offspring 

1 0 were homozygous for the mutation, the remaining 55 conventional animals were heterozygous. 

To examine the distribution of the nt821(del11) mutation in different conventional 
and double-muscled breeds, a cohort of 25 normal individuals was genotyped representing two 
dairy breeds (Holstein-Friesian, Red-and -White) and a cohort of 52 double-muscled animals 
representing four breeds (BBCB, Asturiana, Maine-Anjou and Piemontese). The results are 

15 summarized in Table 2. All dairy animals were homozygous normal except for one Red-and- 
White bull shown to be heterozygous. The occurrence of a small fraction of individuals carrying 
the mutation in dairy cattle is not unexpected as the phenotype is occasionally described in this 
breed, in BBCB and Asturiana, all double-muscled animals were homozygous for the 
nt821(del11) deletion, pointing towards allelic homogeneity in these two breeds. Double- 

20 muscled Maine-Anjou and Piemontese animals were homozygous "normal", i.e. they did not 
show the nt821(del11) deletion but a distinct cysteine to tyrosine substitution (C313Y) in double- 
muscled Piedmontese animals identified by others (Kambadur et a/., 1997) was discovered. 



Table 2: 



25 



Breed 


Phenotype 


Genotype 

+ / + +/nt821(del11) nt821(del11)/nt821(del11) 


Belgian Blue 


DM 






29 


Asturiana 


DM 






10 


Piemontese 


DM 


8 






Maine-Anjou 


DM 


4 






Holstein-Friesian 


Normal 


13 






Red-and-White 


Normal 


12 


1 





The entire coding sequence was also determined for the myostatin gene in 
double-muscled individuals from ten European cattle breeds and a series of mutations that 
35 disrupt myostatin function were identified. 
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The coding sequence of four control Holstein-Friesian and Jersey individuals 
was identical to the previously described wild-type allele (Grobet et a/. t 1997), further indicating 
that it was the genuine myostatin coding sequence being amplified, and not a non-functional 
pseudogene. 

5 Amongst the 32 double-muscled animals, seven DNA sequence variants within 

the coding region were found, as summarized in Figure 3. 

In addition to the nt821(deI11) mutation in the third exon, described above, four 
new mutations that would be expected to disrupt the myostatin function were found. An 
insertion/deletion at position 419 counting from the initiation codon, replacing 7 base pairs with an 
10 apparently unrelated stretch of 10 base pairs, reveals a premature stop codon in the N-terminal 
latency-associated peptide at amino-acid position 140. This mutation is referred to as 
nt419(de!7-ins10). Two base pair substitutions in the second exon, a C-+T transition at 
nucleotide position 610 and a G— *T transversion at nucleotide position 676, each yield a 
premature stop codon in the same N-terminal latency-associated peptide at amino-acid positions 
15 204 and 226 respectively. These mutations are called Q204X and E226X respectively. Finally, a 
G-»A transition at nucleotide position 938 results in the substitution of a cysteine by a tyrosine. 
This mutation is referred to as C313Y. This cysteine is the fifth of nine highly conserved cysteine 
residues typical of the members of the TGF-p superfamily and shared in particular by TGF-pi , - 
P2 and -p3, and inhibin-pA and -pB (McPherron & Lee, 1996). It is thought to be involved in an 
20 intramolecular disulfide bridge stabilizing the three-dimensional conformation of the bioactive 
carboxyterminal peptide. Its substitution is therefore likely to affect the structure and function of 
the protein. This C313Y has recently also been described by Kambadur et a/. (1997). 

A conservative phenylalanine to leucine substitution was also found at amino- 
acid position 94 in the first exon, due to a C-*A transversion at nucleotide position 282 of the 
25 myostatin gene. Given the conservative nature of the amino-acid substitution, its location in the 
less conserved N-terminal latency-associated peptide, and as this mutation was observed at the 
homozygous condition in animals that were not showing any sign of exceptional muscular 
development, this mutation probably does not interfere drastically with the myostatic function of 
the encoded protein, if at ail. This mutation is referred to as F94L The murine protein is 
30 characterized by a tyrosine at the corresponding amino-acid position. 

Also identified was a silent C—>T transition at the third position of the 1 38th 
cytosine codon in the second exon, referred to as nt414(C-T). 

In addition to these DNA sequence polymorphisms detected in the coding region 
of the myostatin gene, also found were four DNA sequence variants in intronic sequences which 
35 are probably neutral polymorphisms and which have been assigned the following symbols: 

nt374-51(T-C), nt374-50(G~A), nt374-16(det1) in intron 1, and nt748-78(de!1) in intron 2 (Figure 

3). 

Figure 4 shows the observed distribution of mutations in the analysed sample 
sorted by breed. For the majority of the studied breeds, the analyzed double-muscled animals 
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were homozygous for one of the five described mutations expected to disrupt the myostatin 
function or compound heterozygotes for two of these mutations. This is compatible with the 
hypothesis that the double-muscled condition has a recessive mode of inheritance in all these 
breeds. 

5 Only in Limousin and Blonde d'Aquitaine was there no clear evidence for the 

role of myostatin loss-of-f unction mutations in the determinism of the observed muscular 
hypertrophy. Most Limousin animals were homozygous for the conservative F94L substitution 
which is unlikely to cause the muscular hypertrophy characterizing these animals, as discussed 
above. One Limousin animal proved to be heterozygous for this mutation, the other allele being 

10 the "wild-type" one. All Blonde d'Aquitaine animals were homozygous "wild-type". These data 
indicate either that the myostatin gene is possibly not involved in the double-muscled condition 
characterizing these two breeds, or that there are additional myostatin mutations outside of the 
coding region. The double-muscling condition is often considered to be less pronounced in 
Limousin animals compared to other breeds. 

15 The cjata indicate that some mutations, such as the nt821del(11) and C313Y, 

are shared by several breeds which points towards gene migration between the corresponding 
populations, while others seems to be confined to specific breeds. Moreover, while some breeds 
(the Belgian Blue breed in particular) seem to be essentially genetically homogeneous others 
show clear evidence for allelic heterogeneity (e.g. Maine-Anjou). 

20 The observation of allelic heterogeneity contradicts with the classical view that a 

single mh mutation spread through the European continent in the beginning of the 19th century 
with the dissemination of the Shorthorn breed from the British Isles (Menissier, 1982). Two of 
the mutations at least are shared by more than one breed, indicating some degree of gene 
migration but definitely not from a single origin. 

25 In mice, and in addition to the in vitro generated myostatin knock-out mice 

(McPherron & Lee, 1997), the compact mutation could be due to a naturally occurring mutation 
at the myostatin gene. The compact locus has been mapped to the D1Mit375-D1Mit21 interval 
on mouse chromosome 1 known to be orthologous to HSA2q31-32 and BTA2q12-22 (Varga et 
a/., 1997). 

30 From an applied point of view, the characterisation of a panel of mutations in 

the myostatin gene associated with double-muscling contributes to the establishment of a 
diagnostic screening system allowing for marker assisted selection for or against this condition in 
cattle. 

Example 1 
35 Genetic and physical mapping 

Integration of the HSA2q31-32 and BTA2q12-22 maps was done by using 
coincident YAC's and the mh locus was positioned in the interval flanked by Col3AI and INPP1 as 
follows. Genetic mapping was performed using a previously described (Holstein-Friesian x 
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Belgian Blu ) x Belgian Blue experimental backcross population counting 108 informative 
individuals (Charlier et a/., 1995). Microsatellite genotyping was performed according to standard 
procedures (Georges et at., 1995), using the primer sequences reported in Table 1 . Linkage 
analyses were performed with the MLINK, ILINK and LINKMAP programs of the LINKAGE 

5 (version 5.1) and FASTLINK (2.3P version, June 1995) packages (Lathrop & Lalouel, 1984; 
Cottingham et a/., 1993). The YAC library was screened by PCR using a three dimensional 
pooling scheme as described in Libert et a/., 1993. The primer pairs corresponding to the CATS 
used to screen the library are reported in Table 1 . Cross-hybridisation between SINE-PCR 
products of individual YACs was performed according to Hunter et at. (1996), using primers 

1 0 reported in Lenstra et al. (1 993). Microsatellites were isolated from YACs according to Cornelis 
etal. (1992). 

Example 2 

Mapping of the human myostatin gene on the Genebridge-4-panel 

DNA from the Genebridge-4 panel (Walter et al., 1994) was purchased from 
15 Research Genetics (Huntsville, Alabama), and genotyped by PCR using standard procedures 
and the following human myostatin primer pair (S'-GGCCCAACTATGGATATATTTG-S' and 5- 
GGTCCTGGGAAGGTTACAGCA-3*). Mapping was performed via the WWW server of the 
Whitehead Institute/MIT Center for Genome Research using their RH~mapper program (Slonim, 
D.; Stein, L.; Kruglyak, L.; Lander, E., unpublished) to position the markers with respect to the 
20 framework map. Segregation vectors of the query markers were compared with the vectors from 
ail markers in the region of interest in the complete Data Release 1 1 .9 (May 1 997) to obtain a 
more precise position. This positions myostatin in the INPP1-Col3AI on the human map with LOD 
score superior to 3. 

Example 3 
25 RT-PCR 

To clone the bovine myostatin orthoiogue a strategy based on RT-PCR 
amplification from skeletal muscle cDNA was chosen. Total RNA was extracted from skeletal 
muscle (Triceps brachialis) according to Chirgwin etal. (1979). RT-PCR was performed using 
the Gene-Amp RNA PCR Kit (Perkin-Eimer) and the primers reported in Table 1 . The PCR 
30 products were purified using QiaQuick PCR Purification kit (Qiagen) and sequenced using Dye 
terminator Cycle Sequencing Ready Reaction (Perkin-Elmer) and an ABI373 automatic 
sequencer, using the primers reported in Table 2. 

Example 4 

Diagnosis of the nt821(de!11) deletion 

35 To diagnose the nt821(del11) the following primer sequences were designed 

flanking the nt821(del11) deletion: 5-TCT AG G AG AG ATTTTG G G CTT-3' (SEQ ID NO:53) and 5- 
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G ATG G GTATG AG G ATACTTTTG C-3' (SEQ ID NO:52). These primers amplify a 188 bp 
fragments from normal individuals and a 177bp fragment from double-muscled individuals. 
Heterozygous individuals show the two amplification products. These amplification products can 
be detected using a variety of methods. In this example the PCR product was labelled by 
5 incorporation of dCTP 32 , separated on a denaturing acrylamide gel and revealed by 
autoradiography. Other approaches that could be used to distinguish the three different 
genotypes are known to those skilled in the art and would include separation in agarose gels and 
visualization with ethidium bromide, direct sequencing, TaqMan assays, hybridization with allele 
specific oligonucleotides, reverse dot-blot, RFLP analysis and several others. The specificity of 
1 0 the test is linked to the detected mutation and not to the primers used in the detection method. 
That means that other primers can easily be designed based on said bovine myostatin sequence 
that would fulfill the same requirements. 

Example 5 

Determination of mutations in other breeds 

15 A total of 32 animals with extreme muscular development were sampled from 

ten European beef cattle breeds in which double-muscled animals are known to occur at high to 
moderate frequency: (i) Belgium: Belgian Blue (4), (ii) France: Blonde d'Aquitaine (5), Charolais 
(2), Gasconne (2), Limousin (5), Maine-Anjou (4), Parthenaise (3), (iii) Spain: Asturiana (2), 
Rubia Gailega (2), (iv) Italy; Piedmontese (2). The determination of the double-muscled 

20 phenotype of the sampled animals was performed visually by experienced observers. Four 

animals with conventional phenotype sampled from the Holstein-Friesian (2) and Jersey (2) dairy 
populations were analysed as controls. 

In order to facilitate the study of the myostatin coding sequence from genomic 
DNA, the sequences of the exon-intron boundaries of the bovine gene were determined. In mice, 

25 the myostatin gene is known to be interrupted by two introns, respectively « 1 .5 and 2.4 Kb long 
(McPherron & Lee, 1997). Two primer pairs were thus designed, respectively, in bovine exons 1 
and 2, and exons 2 and 3, that were predicted to flank the two introns, assuming conservation of 
gene organisation between mouse and cattle (Figure 3 and Table 3). Using these primer sets, 
two PCR products respectively 2Kb and 3.5Kb long were generated from a YAC clone (179A3) 

30 containing the bovine myostatin gene (Grobet et a/., 1997). The PCR products were purified 

using QiaQuick PCR Purification kit (Qiagen) and partially sequenced using Dye terminator Cycle 
Sequencing Ready Reaction (Perkin-Elmer) and an ABI373 automatic sequencer. Alignment 
with the bovine cDNA sequence identified the four predicted exon-intron boundaries. The 
nucleotide sequence corresponding to bovine genomic DNA is identified as SEQ ID NO:54. 



WO 99/02667 PCT/IB98/01197 

-21 - 

Table 3: Primers used for PCR amplification and cycle sequencing- 



II HI VJI 1 1 O 


5-GAAGACGATGACTACCAC 
GCCAGGACG-3* 


ntronl -3* 


^-CTAGTTTATTGTATTGTATCTT 
AGAGC-3* 


lntrnn2-5* 

11 III Vl Ifa V 


5-AGACTCCTACAACAGTGT 
TTGT-3* 


ntron2-3* 


S'-ATACTCWAGGCCTAYAGCCT 
GTGGT-3* 




5-ATTCACTGGTGTGGCAAG 
TTGTCTCTCAGA-3* 


Exon1-3* 


S-CCCTCCTCCTTACATACAAGC 
CAGCAG-3' 




5'-GTTCATAGATTGATATGG 
AGGTGTTCG-3' 


Exon2-3' 


5*-ATAAGCACAGGAAACTGGTAG 
TTATT-3' 




5*-GAAATGTGACATAAGCAA 
AATGATTAG-3' 


Exon3-3' 


S'-ATACTCWAGGCCTAYAGCCT 
GTGGT-3' 


Exon1-Sea1 


S'-TTGAGGATGTAGTGTTTT 
CC-3' 


Exonl -Seq2 


5-GCCATAAAAATCCAAATCCTC 
AG-3 1 


Exon2-Seq1 


5'-CAl I I ATAGCTGATCTTC 
TAACGCAAG-3' 


Exon2-Seq2 


5*-TGTCGCAGGAGTCTTGACAG 
GCCTCAG-3' 


Exon2-Seq3 


5-GTACAAGGTATACTGGAA 
TCCGATCTC-3* 






Exon3-Seq1 


5-AGCAGGGGCCGGCTGAA 
CCTCTGGG-3* 


Exon3-Seq2 


5-CCCCAGAGGTTCAGCCGGCC 
CCTGC-3* 



Based on the available exonic and intronic sequences of the bovine myostatin 
gene, three primer pairs that jointly allow for convenient amplification of the entire coding 
sequence from genomic DNA were designed. The position of the corresponding primers is 
shown in Figure 3, and the corresponding sequences are reported in Table 3. 

1 5 After PCR amplification of the entire coding sequence from genomic DNA in the 

three described fragments, these were purified using QiaQuick PCR Purification kit (Qiagen) and 
sequenced using Dye terminator Cycle Sequencing Ready Reaction (Perkin-Elmer) and an 
ABI373 automatic sequencer, using the primers used for amplification as well as a series of 
nested primers (Figure 3 and Table 3). Chromat files produced with the ABI373 sequencer were 

20 analysed with the Polyphred application (D. Nickerson, personal communication), which is part of 
a series of sequence analysis programs including Phred (Ewing, B. & Green, P. (1992), 
unpublished), Phrap (Green, P. (1994), unpublished) and Consed (Gordon, D. (1995), 
unpublished), but any suitable sequencing programme would do, as known to a person skilled in 
the art. 

25 Monoclonal antibodies (Mab's) specific for myostatin are useful. In the case of 

the bovine protein having the amino acid sequence identified as SEQ ID NO:2, for example, 
antibodies can be used for diagnostic purposes such as for determining myostatin protein levels 
in muscle tissue. To produce these antibodies, purified myostatin is prepared. The myostatin 
can be produced in bacterial cells as a fusion protein with glutathione-S-transferase using the 

30 vector pGEX2 (Pharmacia). This permits purification of the fusion protein by GSH affinity 



WO 99/02667 PCT/IB98/0lk97 

-22- 

chromatography. In another approach, myostatin is expressed as a fusion protein with the 
bacterial maltose binding domain. The fusion protein is thus recovered from bacterial extracts by 
passing the extract over an amylose resin column followed by elution of the fusion protein with 
maltose. For this fusion construct, the vector pMalC2, commercially available from New England 
5 Biolabs, can be used. The preparation of a second fusion protein is also useful in the preliminary 
screening of MAb's. 

The generation of hybridpmas expressing monoclonal antibodies recognizing 
myostatin protein is carried out as follows: BALB/c mice are injected intraperitoneal with 
protein/adjuvant three times at one-month intervals, followed by a final injection into the tail vein 

10 shortly prior to cell fusion. Spleen cells are harvested and fused with NS-1 myeloma cells 

(American Type Culture Collection, Rockville, MD) using polyethylene glycol 4000 according to 
standard protocols (Kennett, 1979; Mirski, 1989). The cell fusion process is carried out as 
described in more detail below. 

The fused cells are plated into 96-well plates with peritoneal exudate cells and 

1 5 irradiated spleen cells from BALB/Ccmice as feeder layers and selection with hypoxanthine, 
aminopterin, and thymidine (HAT medium) is performed. 

An ELISA assay is used as an initial screening procedure. 1-10 pg of purified 
myostatin (cleaved from the fusion protein) in PBS is used to coat individual wells, and 50-100 pi 
per well of hybridoma supernatants is incubated. Horseradish peroxidase-conjugated anti- 

20 mouse antibodies are used for the colorimetric assay. 

Positive hybridomas are cloned by limiting-dilution and grown to large-scale for 
freezing and antibody production. Various positive hybridomas are selected for usefulness in 
western blotting and immunohistochemistry, as well as for cross reactivity with myostatin proteins 
from different species, for example the mouse and human proteins. 

25 Alternatively, active immunization by the generation of an endogenous antibody 

by direct exposure of the host animal to small amounts of antigen can be carried out. Active 
immunization involves the injection of minute quantities of antigen (g) which probably will not 
induce a physiological response and will be degraded rapidly. Antigen will only need to be 
administered as prime and boost immunizations in much the same manner as techniques used 

30 to confer disease resistance (Pell et a/., 1 997). 

Antisense nucleic acids or oligonucleotides (RNA or preferably DNA) can be 
used to inhibit myostatin production in order to increase muscle mass of an animal. Antisense 
oligonucleotides, typically 15 to 20 bases long, bind to the sense mRNA or pre mRNA region 
coding for the protein of interest, which can inhibit translation of the bound mRNA to protein. The 

35 cDNA sequence encoding myostatin can thus be used to design a series of oligonucleotides 
which together span a large portion, or even the entire cDNA sequence. These oligonucleotides 
can be tested to determine which provides the greatest inhibitory effect on the expression of the 
protein (Stewart, 1996). The most suitable mRNA target sites include 5- and 3-untranslated 
regions as well as the initiation codon. Other regions might be found to be more or less effective. 
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Alternatively, an antisense nucleic acid or oligonucleotide may bind to myostatin coding or 
regulatory sequences. 

Rather than reducing myostatin activity by inhibiting myostatin gene expression 
at the nucleic acid level, activity of the myostatin protein may be directly inhibited by binding to an 
5 agent, such as, for example, a suitable small molecule or a monoclonal antibody. 

It will of course be understood, without the intention of being limited thereby, that 
a variety of substitutions of amino acids is possible while preserving the structure responsible for 
myostatin activity of the proteins disclosed herein. Conservative substitutions are described in the 
patent literature, as for example, in United States Patent No. 5,264,558 or 5,487,983. It is thus 
10 expected, for example, that interchange among non-polar aliphatic neutral amino acids, glycine, 
alanine, proline, valine and isoleucine, would be possible. Likewise, substitutions among the 
polar aliphatic neutral amino acids, serine, threonine, methionine, asparagine and glutamine 
could possibly be made. Substitutions among the charged acidic amino acids, aspartic acid and 
glutamic acid, could probably be made, as could substitutions among the charged basic amino 
15 acids, lysine and arginine. Substitutions among the aromatic amino acids, including 

phenylalanine, histidine, tryptophan and tyrosine would also likely be possible. These sorts of 
substitutions and interchanges are well known to those skilled in the art. Other substitutions 
might well be possible. Of course, it would also be expected that the greater the percentage of 
homology, i.e., sequence similarity, of a variant protein with a naturally occurring protein, the 
20 greater the retention of metabolic activity. Of course, as protein variants having the activity of 
myostatin as described herein are intended to be within the scope of this invention, so are 
nucleic acids encoding such variants. 

A further advantage may be obtained through chimeric forms of the protein, as 
known in the art. A DNA sequence encoding the entire protein, or a portion of the protein, could 
25 thus be linked, for example, with a sequence coding for the C-terminal portion of E. coli R- 

gaiactosidase to produce a fusion protein. An expression system for human respiratory syncytial 
virus glycoproteins F and G is described in United States Patent No. 5,288,630 issued February 
22, 1994 and references cited therein, for example. 

A recombinant expression vector of the invention can be a plasmid, as described 
30 above. The recombinant expression vector of the invention further can be a virus, or portion 
thereof, which allows for expression of a nucleic acid introduced into the viral nucleic acid. For 
example, replication defective retroviruses, adenoviruses and adeno-associated viruses can be 
used. 

The recombinant expression vectors of the invention can be used to make a 
35 transformant host cell including the recombinant expression vector. The term "transformant host 
cell" is intended to include prokaryotic and eukaryotic cells which have been transformed or 
transfected with a recombinant expression vector of the invention. The terms "transformed with", 
"transfected with", "transformation" and "transfection" are intended to encompass introduction of 
nucleic acid (e.g. a vector) into a ceil by one of many possible techniques known in the art. 
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Prokaryotic cells can be transformed with nucleic acid by, for example, electroporation or 
calcium-chloride mediated transformation. Nucleic acid can be introduced into mammalian cells 
via conventional techniques such as calcium phosphate or calcium chloride coprecipitation, 
DEAE-dextran-mediated transfection, lipofection, electroporation or microinjection. Suitable 
5 methods for transforming and transfecting host cells are known (Sambrook, 1989). 

The number of host cells transformed with a recombinant expression vector of 
the invention by techniques such as those described above will depend upon the type of 
recombinant expression vector used and the type of transformation technique used. Plasmid 
vectors introduced into mammalian cells are integrated into host cell DNA at only a low 

10 frequency. In order to identify these integrants, a gene that contains a selectable marker (e.g. 
resistance to antibiotics) is generally introduced into the host ceils along with the gene of interest. 
Preferred selectable markers include those which confer resistance to certain drugs, such as 
G418 and hygromycin. Selectable markers can be introduced on a separate plasmid from the 
nucleic acid of interest or, preferably, are introduced on the same plasmid. Host cells 

15 transformed with one or more recombinant expression vectors containing a nucleic acid of the 
invention and a gene for a selectable marker can be identified by selecting for cells using the 
selectable marker. For example, if the selectable marker encodes a gene conferring neomycin 
resistance (such as pRc/CMV), transformant cells can be selected with G418. Cells that have 
incorporated the selectable marker gene will survive, while the other cells die. 

20 Nucleic acids which encode myostatin proteins can be used to generate 

transgenic animals. A transgenic animal (e.g., a mouse) is an animal having ceils that contain a 
transgene, which transgene is introduced into the animal or an ancestor of the animal at a 
prenatal, e.g., an embryonic stage. A transgene is a DNA which is integrated into the genome of 
a ceil from which a transgenic animal develops. In one embodiment, a bovine cDNA, comprising 

25 the nucleotide sequence shown in SEQ ID NO:1, or an appropriate variant or subsequence 
thereof, can be used to generate transgenic animals that contain cells which express bovine 
myostatin. Likewise, variants such as mutant genes (e.g. SEQ ID NO:3) can be used to generate 
transgenic animals. This could equally well be done with the human myostatin protein and 
variants thereof. "Knock out" animals, as described above, can also be generated. Methods for 

30 generating transgenic animals, particularly animals such as mice, have become conventional in 
the art are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009. In a preferred 
embodiment, plasmids containing recombinant molecules of the invention are microinjected into 
mouse embryos. In particular, the plasmids are microinjected into the male pronuclei of fertilized 
one-cell mouse eggs; the injected eggs are transferred to pseudo-pregnant foster females; and, 

35 the eggs in the foster females are allowed to develop to term. (Hogan, 1986). Alternatively, an 
embryonal stem cell line can be transfected with an expression vector comprising nucleic acid 
encoding a myostatin protein, and cells containing the nucleic acid can be used to form 
aggregation chimeras with embryos from a suitable recipient mouse strain. The chimeric 
embryos can then be implanted into a suitable pseudopregnant female mouse of the appropriate 
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strain and the embryo brought to term. Progeny harboring the transfected DNA in their germ 
cells can be used to breed uniformly transgenic mice. 

Such animals could be used to determine whether a sequence related to an 
intact myostatin gene retains biological activity of myostatin. Thus, for example, mice in which 
5 the murine myostatin gene has been knocked out and containing the nucleic acid sequence 
identified as SEQ ID NO:l could be generated along with animals containing the nucleic acid 
sequence identified as SEQ ID NO:3. The animals could be examined for display of muscular 
hyperplasia, especially in comparison with knockout mice, which are known to display such. In 
this way it can be shown that the protein encoded by SEQ ID NO:3 lacks myostatin activity within 
1 0 the context of this invention while the protein encoded by the nucleic acid sequence identified as 
SEQ ID NO:1 possesses biological activity of myostatin. 

In such experiments, muscle cells would be particularly targeted for myostatin 
(and variants) transgene incorporation by use of tissue specific enhancers operatively linked to 
the encoding gene. For example, promoters and/or enhancers which direct expression of a 
15 gene to which they are operatively linked preferentially in muscle cells can be used to create a 
transgenic animal which expresses a myostatin protein preferentially in muscle tissue. 
Transgenic animals that include a copy of a myostatin transgene introduced into the germ line of 
the animal at an embryonic stage can also be used to examine the effect of increased myostatin 
expression in various tissues. 
20 The pattern and extent of expression of a recombinant molecule of the invention 

in a transgenic mouse is facilitated by fusing a reporter gene to the recombinant molecule such 
that both genes are co-transcribed to form a polycistronic mRNA. The reporter gene can be 
introduced into the recombinant molecule using conventional methods such as those described 
in Sambrook et a/., (Sambrook, 1989). Efficient expression of both cistrons of the polycistronic 
25 mRNA encoding the protein of the invention and the reporter protein can be achieved by 

inclusion of a known internal translational initiation sequence such as that present in poliovirus 
mRNA. The reporter gene should be under the control of the regulatory sequence of the 
recombinant molecule of the invention and the pattern and extent of expression of the gene 
encoding a protein of the invention can accordingly be determined by assaying for the phenotype 
30 of the reporter gene. Preferably the reporter gene codes for a phenotype not displayed by the 
host cell and the phenotype can be assayed quantitatively. Examples of suitable reporter genes 
include lacZ (p-galactosidase), neo (neomycin phosphotransferase), CAT (chloramphenicol 
acetyltransferase) dhfr (dihydrofolate reductase), aphlV (hygromycin phosphotransferase), lux 
(luciferase), uidA (p-glucuronidase). Preferably, the reporter gene is lacZ which codes for p~ 
35 galactosidase. p-galactosidase can be assayed using the lactose analogue X-gal (5-bromo-4- 
chloro-3-indolyl-b-D-gaiactopyranoside) which is broken down by p-galactosidase to a product 
that is blue in color (Old). 

The present invention includes knocking out wild type myostatin in mammals, in 
order to obtain the desired effect(s) thereof. This is particularly true in cattle raised for beef 
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production. It may well prove advantageous to substitute a defective gene (e.g. SEQ ID NO:3 or 
it genomic analogue) rather than delete the entire sequence of DNA encoding for a protein 
having myostatin activity. A method of producing a transgenic bovine or transgenic bovine 
embryo is described in United States Patent No. 5,633,076, issued May 27, 1997, for example. 

5 The transgenic animals of the invention can be used to investigate the molecular 

basis of myostatin action. For example, it is expected that myostatin mutants in which one or 
more of the conserved cysteine residues has been deleted would have diminished activity in 
relation to a wild type myostatin protein in which all such residues are retained. Further, deletion 
of proteolytic cleavage site would likely result in a mutant lacking biological activity of myostatin. 

10 Transgenesis can be used to inactivate myostatin activity. This could be 

achieved using either conventional transgenesis, i.e. by injection in fertilized oocytes, or by gene 
targeting methods using totipotent ceil lines such as ES (embryonic stem cells) which can then 
be injected in oocytes and participate in the development of the resulting organisms or whose 
nucleus can be transferred into unfertilized oocytes, nucleus transfer or cloning. 

15 It is also possible to create a genetically altered animal in which the double- 

muscling trait is dominant so that the animal would be more useful in cross-breeding. Further, in 
a particular aspect, the dominant trait would be male specific. In this way, bulls would be double- 
muscled but cows would not be. In addition, or alternatively, the trait would also be unexpressed 
until after birth or inducible. If inducible the trait could be induced after birth to avoid the calving 

20 difficulties described above. 

There are at least three approaches that can be taken to create a dominant 
u mh n allele. Because functional myostatin, a member of the TGF-B superfamily, is a dimer, 
dominant negative myostatin mutations can be created (Herskowitz etal., 1987; Lopez et a/., 
1992). According to one method, this is accomplished by mutating the proteolytic processing 

25 site of myostatin. To enhance the dominant negative effect, the gene can be put under the 
control of a stronger promoter such as the CMV promoter or that of a myosin gene, which is 
tissue specific, i.e., expressed only in skeletal muscle. Alternatively, an antisense sequence of 
that encoding myostatin could be incorporated into the DNA, so that complementary mRNA 
molecules are generated, as understood by a person skilled in the art. Optionally, a ribozyme 

30 could be added to enhance mRNA breakdown. In another approach, ere recombinase 

generate/ribozyme approach or the Cre-lox P system could be used (Hoess e/a/., 1982; Gu et 
a/., 1994). 

Male specificity can be achieved by placing the dominant mh alleles on the Y 
chromosome by homologous recombination. 
35 inducibility can be achieved by choosing promoters with post-natal expression in 

skeletal muscle or using inducible systems such at he Tet-On and Tet-Off systems could be 
used (Gossen et aL, 1992; Shockett et a/., 1996). 

Using conventional transgenesis a gene coding for a myostatin antisense is 
injected, for example, by inverting the orientation of the myostain gene in front of its natural 
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promoter and enhancer sequences. This is followed by injection of a gene coding for an anti- 
myostain ribozyme, i.e. an RNA that would specifically bind to endogenous myostain mRNA and 
destroy it via its "ribozyme" activity. 

Also, through gene targeting, a conventional knock-out animal can be 
5 generated, specific mutations by gene replacement can be engineered. It is possible to 

inactivate the myostain gene at a specific developmental time, such as after birth to avoid calving 
difficulties. As mentioned above, this could be achieved using the Cre-lox P systems in which 
1.ox P sides are engineered around the myostain gene by homologous recombination (gene 
targeting), and mating these animals with transgenic animals having a Cre transgene (coding for 
1 0 the Cre recombinase existing DNA flanked by J oxP sides) under the dependence of a skeletal 
muscle specific promoter only active after birth. This is done to obtain individuals that would 
inactivate their myostain gene after birth. As mentioned above, there are also gene targeting 
systems that allow genes to be turned on and off by feeding an animal with, for example, an 
antibiotic. In such an instance, one engineers an operator between the promoter of the gene and 
15 the gene itself. This operator is the target of a repressor which when binding inactivates the gene 
(for example, the lac operon in £. co//). The repressor is brought into the cell using conventional 
transgenesis, for example, by injection of the gene coding for the repressor. 

Transgenic animals of the invention can also be used to test substances for the 
ability to prevent, slow or enhance myostatin action. A transgenic animal can be treated with the 
20 substance in parallel with an untreated control transgenic animal. 

The antisense nucleic acids and oligonucleotides of the invention are useful for 
inhibiting expression of nucleic acids (e.g. mRNAs) encoding proteins having myostatin activity. 

The isolated nucleic acids and antisense nucleic acids of the invention can be 
used to construct recombinant expression vectors as described previously. These recombinant 
25 expression vectors are then useful for making transformant host cells containing the recombinant 
expression vectors, for expressing protein encoded by the nucleic acids of the invention, and for 
isolating proteins of the invention as described previously. The isolated nucleic acids and 
antisense nucleic acids of the invention can also be used to construct transgenic and knockout 
animals as described previously. 
30 The isolated proteins of the invention are useful for making antibodies reactive 

against proteins having myostatin activity, as described previously. Alternatively, the antibodies of 
the invention can be used to isolate a protein of the invention by standard immunoaffinity 
techniques. Furthermore, the antibodies of the invention, including bispecific antibodies are 
useful for diagnostic purposes. 
35 Molecules which bind to a protein comprising an amino acid sequence shown in 

SEQ ID NO:2 can also be used in a method for killing a cell which expresses the protein, wherein 
the cell takes up the molecule, if for some reason this were desirable. Destruction of such cells 
can be accomplished by labeling the molecule with a substance having toxic or therapeutic 
activity. The term "substance having toxic or therapeutic activity" as used herein is intended to 



WO 99/02667 PCT/IB98/01 1 97 

-28- 

include molecules whose action can destroy a cell, such as a radioactive isotope, a toxin (e.g. 
diphtheria toxin or ricin), or a chemotherapeutic drug, as well as cells whose action can destroy a 
cell, such as a cytotoxic cell. The molecule binding to the myostatin can be directly coupled to a 
substance having a toxic or therapeutic activity or may be indirectly linked to the substance. In 

5 one example, the toxicity of the molecule taken up by the cell is activated by myostatin protein. 

The invention also provides a diagnostic kit for identifying cells comprising a 
molecule which binds to a protein comprising an amino acid sequence shown in SEQ ID NO:2, 
for example, for incubation with a sample of tumor cells; means for detecting the molecule 
bound to the protein, unreacted protein or unbound molecule; means for determining the amount 

0 of protein in the sample; and means for comparing the amount of protein in the sample with a 
standard. Preferably, the molecule is a monoclonal antibody. In some embodiments of the 
invention, the detectability of the molecule which binds to myostatin is activated by said binding 
(e.g., change in fluorescence spectrum, loss of radioisotopic label). The diagnostic kit can also 
contain an instruction manual for use of the kit. 

5 The invention further provides a diagnostic kit for identifying cells comprising a 

nucleotide probe complementary to the sequence, or an oligonucleotide fragment thereof, 
shown in SEQ ID NO:1 , for example, for hybridization with mRNA from a sample of cells, e.g., 
muscle cells; means for detecting the nucleotide probe bound to mRNA in the sample with a 
standard. In a particular aspect, the invention is a probe having a nucleic acid molecule 

0 sufficiently complementary with a sequence identified as SEQ ID NO;1 , or its complement, so as 
to bind thereto under stringent conditions. "Stringent hybridization conditions" takes on its 
common meaning to a person skilled in the art here. Appropriate stringency conditions which 
promote nucleic acid hybridization, for example, 6x sodium chloride/sodium citrate (SSC) at 
about 45°C are known to those skilled in the art. The following examples are found in Current 

5 Protocols in Molecular Biology, John Wiley & Sons, NY (1989), 6.3.1-6.3.6: For 50 ml of a first 
suitable hybridization solution, mix together 24 ml formamide, 12 ml 20x SSC, 0.5 ml 2 M 
Tris-HCI pH 7.6, 0.5 ml 100x Denhardfs solution, 2.5 ml deionized H 2 O f 10 ml 50% dextran 
sulfate, and 0.5 ml 10% SDS. A second suitable hybridization solution can be 1% crystalline 
BSA (fraction V), 1 mM EDTA, 0.5 M Na 2 HP0 4 pH 7.2, 7% SDS. The salt concentration in the 

0 wash step can be selected from a low stringency of about 2x SSC at SOX to a high stringency of 
about 0.2x SSC at 50°C. Both of these wash solutions may contain 0.1% SDS. In addition, the 
temperature in the wash step can be increased from low stringency conditions at room 
temperature, about 22°C, to high stringency conditions, at about 65°C. The cited reference 
gives more detail, but appropriate wash stringency depends on degree of homology and length 

5 of probe. If homology is 100%, a high temperature (65°C to 75°C) may be used. If homology is 
low, lower wash temperatures must be used. However, if the probe is very short (<100bp), lower 
temperatures must be used even with 100% homology. In general, one starts washing at low 
temperatures (37°C to 40°C), and raises the temperature by 3-5°C intervals until background is 
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low enough not to be a major factor in autoradiography. The diagnostic kit can also contain an 
instruction manual for use of the kit. 

The invention provides a diagnostic kit which can be used to determine the 
genotype of mammalian genetic material, for example. One kit includes a set of primers used 
5 for amplifying the genetic material. A kit can contain a primer including a nucleotide sequence 
for amplifying a region of the genetic material containing one of the naturally occurring mutations 
described herein. Such a kit could also include a primer for amplifying the corresponding region 
of the normal gene that produces functional myostatin. Usually, such a kit would also include 
another primer upstream or downstream of the region of interest complementary to a coding 
10 and/or non-coding portion of the gene. A particular kit includes a primer selected from a non- 
coding sequence of a myostatin gene. Examples of such primers are provided in Table 3, 
designated as Exon1~5\ Exonl-3', Exon2-5\ Exon3-5' and Exon3-3\ These primers are used to 
amplify the segment containing the mutation of interest. The actual genotyping is carried out 
using primers that target specific mutations described herein and that could function as allele- 
15 specific oligonucleotides in conventional hybridization, Taqman assays, OLE assays, etc. 
Alternatively, primers can be designed to permit genotyping by microsequencing. 

One kit of primers thus includes first, second and third primers, (a), (b) and (c), 
respectively. Primer (a) is based on a region containing a myostatin mutation, for example a 
region of the myostatin gene spanning the nt821del(11) deletion. Primer (b) encodes a region 
20 upstream or downstream of the region to be amplified by primer (a) so that genetic material 
containing the mutation is amplified, by PCR, for example, in the presence of the two primers. 
Primer (c) is based on the region corresponding to that on which primer (a) is based, but lacking 
the mutation. Thus, genetic material containing the non-mutated region will be amplified in the 
presence of primers (b) and (c). Genetic material homozygous for the wild type gene will thus 
25 provide amplified products in the presence of primers (b) and (c). Genetic material homozygous 
for the mutated gene will thus provide amplified products in the presence of primers (a) and (b). 
Heterozygous genetic material will provide amplified products in both cases. 

The invention provides purified proteins having biological activity of myostatin. 
The terms "isolated" and "purified" each refer to a protein substantially free of cellular material or 
30 culture medium when produced by recombinant DNA techniques, or chemical precursors or 
other chemicals when chemically synthesized. In certain preferred embodiments, the protein 
having biological activity of myostatin comprises an amino acid sequence identified as SEQ ID 
NO:2. Furthermore, proteins having biological activity of myostatin that are encoded by nucleic 
acids which hybridize under stringent conditions, as discussed above, to a nucleic acid 
35 comprising a nucleotide sequence identified as SEQ ID NO:1 or SEQ ID NO:7 are encompassed 
by the invention. Proteins of the invention having myostatin activity can be obtained by expression 
in a suitable host cell using techniques known in the art. Suitable host ceils include prokaryotic 
or eukaryotic organisms or cell lines, for example, yeast, E. coll, insect cells and COS 1 cells. 
The recombinant expression vectors of the invention, described above, can be used to express a 
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protein having myostatinl activity in a host cell in order to isolate the protein. The invention 
provides a method of preparing an purified protein of the invention comprising introducing into a 
host cell a recombinant nucleic acid encoding the protein, allowing the protein to be expressed in 
the host cell and isolating and purifying the protein. Preferably, the recombinant nucleic acid is a 

5 recombinant expression vector. Proteins can be isolated from a host cell expressing the protein 
and purified according to standard procedures of the art, including ammonium sulfate 
precipitation, column chromatography (e.g. ion exchange, gel filtration, affinity chromatography, 
etc.), electrophoresis, and ultimately, crystallization (see generally, "Enzyme Purification and 
Related Techniques", Methods in Enzymology, 22, 233-577 (1 971)). 

10 Alternatively, the protein or parts thereof can be prepared by chemical synthesis 

using techniques well known in the chemistry of proteins such as solid phase synthesis 
(Merrifield, 1964), or synthesis in homogeneous solution (Houbenwcyl, 1987). 

The protein of the invention, or portions thereof, can be used to prepare 
antibodies specific for the proteins. Antibodies can be prepared which bind to a distinct epitope in 

15 an unconserved region of a particular protein. An unconserved region of the protein is one which 
does not have substantial sequence homology to other proteins, for example other members of 
the myostatin family or other members of the TGFp superfamily. Conventional methods can be 
used to prepare the antibodies. For example, by using a peptide of a myostatin protein, 
polyclonal antisera or monoclonal antibodies can be made using standard methods. A mammal, 

20 (e.g. a mouse, hamster, or rabbit) can be immunized with an immunogenic form of the peptide 
which elicits an antibody response in the mammal. Techniques for conferring immunogenicity on 
a peptide include conjugation to carriers or other techniques well known in the art. For example, 
the peptide can be administered in the presence of adjuvant The progress of immunization can 
be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other 

25 immunoassay can be used to assess the levels of antibodies. Following immunization, antisera 
can be obtained and, if desired, polyclonal antibodies isolated from the sera. 

To produce monoclonal antibodies, antibody producing ceils (lymphocytes) can 
be harvested from an immunized animal and fused with myeloma cells by standard somatic cell 
fusion procedures, thus immortalizing these cells and yielding hybridoma cells. Such techniques 

30 are well known in the art. For example, the hybridoma technique originally developed by Kohler 
and Milstein (Kohler, 1975) as well as other techniques such as the human B-cel! hybridoma 
technique (Kozbor, 1983), the EBV-hybridoma technique to produce human monoclonal 
antibodies (Cole, 1985), and screening of combinatorial antibody libraries (Huse, 1989). 
Hybridoma cells can be screened immunochemically for production of antibodies specifically 

35 reactive with the peptide, and monoclonal antibodies isolated. 

The term antibody as used herein is intended to include fragments thereof which 
are also specifically reactive with a protein having the biological activity of myostatin, or a peptide 
fragment thereof. Antibodies can be fragmented using conventional techniques and the 
fragments screened for utility in the same manner as described above for whole antibodies. For 
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example, F^b^ fragments can be generated by treating antibody with pepsin. The resulting 
F(ab% fragment can be treated to reduce disulfide bridges to produce Fab 1 fragments. 

It is also known in the art to make chimeric antibody molecules with human 
constant regions. See, for example, Morrison et a/., Takeda et a/., Cabiliy et a/., Boss et a/., 
5 Tanaguchi et a/., Teng etai (Morrison, 1985; Takeda, 1985; Cabiliy; Boss; Tanaguchi; Teng, 
1982), European Patent Publication 0173494, United Kingdom Patent GB 2177096B, PCT 
Publication WO92/06193 and EP 0239400. It is expected that such chimeric antibodies would be 
less immunogenic in a human subject than the corresponding non-chimeric antibody. 

Another method of generating specific antibodies, or antibody fragments, 
10 reactive against protein having the biological activity of a myostatin protein, or a peptide fragment 
thereof, is to screen expression libraries encoding immunoglobulin genes, or portions thereof, 
expressed in bacteria, with peptides produced from the nucleic acid molecules of the present 
invention. For example, complete Fab fragments, VH regions and FV regions can be expressed 
in bacteria using phage expression libraries. See for example Ward et a/., Huse et a/., and 
15 McCafferty etai. (Ward, 1989; Huse, 1989; McCafferty, 1990). Screening such libraries with, for 
example, a myostatin protein can identify immunoglobulin fragments reactive with myostatin. 
Alternatively, the SCID-hu mouse developed by Genpharm can be used to produce antibodies, 
or fragments thereof. 

The polyclonal, monoclonal or chimeric monoclonal antibodies can be used to 
20 detect the proteins of the invention, portions thereof or closely related isoforms in various 
biological materials, for example they can be used in an ELISA, radioimmunoassay or 
histochemical tests. Thus, the antibodies can be used to quantify the amount of a myostatin 
protein of the invention, portions thereof or closely related isoforms in a sample in order to 
determine the role of myostatin proteins in particular cellular events or pathological states. Using 
25 methods described hereinbefore, polyclonal, monoclonal antibodies, or chimeric monoclonal 
antibodies can be raised to nonconserved regions of myostatin and used to distinguish a 
particular myostatin from other proteins. 

The polyclonal or monoclonal antibodies can be coupled to a detectable 
substance or reporter system. The term "coupled" is used to mean that the detectable 
30 substance is physically linked to the antibody. Suitable detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent materials and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 
P-galactosidase, and acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
35 umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 

fluorescein, dansyl chloride and phycoerythrin; an example of a luminescent material includes 
luminol; and examples of suitable radioactive material include 125 l; 131 l, 35 S and 3 H. In a preferred 
embodiment, the reporter system allows quantitation of the amount of protein (antigen) present. 
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Such an antibody-linked reporter system could be used in a method for 
determining whether a fluid or tissue sample of a subject contains a deficient amount or an 
excessive amount of the protein. Given a normal threshold concentration of such a protein for a 
given type of subject, test kits could thus be developed. 
5 The present invention allows the skilled artisan to prepare bispecific antibodies 

and tetrameric antibody complexes. Bispecific antibodies can be prepared by forming hybrid 
hybridomas (Staerz, 1 986a &b). 

Compositions of the invention are administered to subjects in a biologically 
compatible form suitable for pharmaceutical administration in vivo. By "biologically compatible 
10 from suitable for administration in v/Vo" is meant a form of the composition to be administered in 
which any toxic effects are outweighed by the therapeutic effects of the composition. The term 
"subject" is intended to include living organisms in which a desired therapeutic response can be 
elicited, e.g. mammals. Examples of subjects include cattle, human, dogs, cats, mice, rats and 
transgenic species thereof. Administration of a therapeutically active amount of the therapeutic 
1 5 compositions of the present invention is defined as an amount effective, at dosages and for 
periods of time necessary to achieve the desired result. For example, a therapeutically active 
amount of a compound that inhibits the biological activity of myostatin protein may vary according 
to factors such as the age, sex, and weight of the individual, as well as target tissue and mode of 
delivery. Dosage regimes may be adjusted to provide the optimum therapeutic response. For 
20 example, several divided doses may be administered daily or the dose may be proportionally 
reduced as indicated by the exigencies of the therapeutic situation. 

As far as the United States is concerned, this application is a Continuation-in- 
Part Application of prior United States Patent Application Serial No. 08/891,789, filed July 14, 
1997, the specification of which is incorporated herein by reference. 
25 Those skilled in the art will know, or be able to ascertain using no more 

than routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 
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CLAIMS 

1 A method of increasing muscle mass of a mammal having muscle cells in which myostatin is 
expressed, the method comprising administering to the mammal an effective amount of a nucleic 
acid molecule substantially complementary to at least a portion of mRNA encoding the myostatin 
5 and being of sufficient length to sufficiently reduce expression of the myostatin to increase the 
muscle mass. 

2. The method of claim 1 wherein the mammal is bovine. 

3. A method of increasing muscle mass of a mammal, the method comprising administering to 
the mammal an effective amount of a nucleic acid molecule having ribozyme activity and a 

1 0 nucleotide sequence substantially complementary to at least a portion of mRNA encoding 
myostatin and being of sufficient length to bind selectively thereto to sufficiently reduce 
expression of the myostatin so as to increase the muscle mass. 

4. The method of claim 3 wherein the mammal is bovine. 

5. A diagnostic kit, for determining the presence of muscular hyperplasia in a mammal from 
15 which a sample containing DNA of the mammal has been obtained, the kit comprising: 

first and second primers for amplifying the DNA, the primers being complementary to 
nucleotide sequences of the DNA upstream and down stream, respectively, of a 
mutation in the portion of the DNA encoding myostatin which results in muscular 
hyperplasia of the mammal, wherein at least one of the nucleotide sequences is 
20 selected to be from a non-coding region of the myostatin gene. 

6. The diagnostic kit of claim 5, further comprising a third primer complementary to a naturally 
occurring mutation of a coding portion of the myostatin gene. 

7. A diagnostic kit, for determining the genotype of a sample of mammalian genetic material, the 
kit comprising: 

25 a pair of primers for amplifying a portion of the genetic material corresponding to a 

nucleotide sequence which encodes at least a portion of a myostatin protein, 
wherein a first of the primers includes a nucleotide sequence sufficiently 
complementary to a mutation of SEQ ID NO:1 to prime amplification of a nucleic 
acid molecule containing the mutation, the mutation being selected from the group 

30 of mutations resulting from: (a) deletion of 11 nucleotides beginning at nucleotide 

821 of the coding portion of SEQ ID NO:1; (b) deletion of 7 nucleotides beginning at 
nucleotide 419 of the coding sequence and insertion of the sequence 
AAGCATACAA in place thereof; (c) deletion of nucleotide 204 of the coding 
sequence and insertion of T in place thereof; (d) deletion of nucleotide 226 of the 

35 coding sequence and insertion of T in place thereof; and (e) deletion of nucleotide 

31 3 of the coding sequence and insertion of A in place thereof; and combinations 
thereof. 
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8. The diagnostic kit of claim 7 wherein a second of the pair of primers is located entirely 
upstream or entirely downstream of the selected mutation or mutations. 

9. The diagnostic kit of claim 8 wherein a first said primer spans mutation (a) and further 
comprising a third primer which is sufficiently complementary to the nucleotide sequence 

5 identified as SEQ ID NO:1 1 to prime amplification of a nucleic acid molecule containing SEQ ID 
NO:1 1 . 

1 0. The diagnostic kit of claim 8 wherein a first said primer is sufficiently complementary to the 
inserted sequence of mutation (b) to prime amplification of a nucleic acid molecule containing 
mutation (b) and further comprising a third primer which is sufficiently complementary to the 

10 sequence corresponding to the 7 nucleotide deletion of mutation (b) to prime amplification of a 
nucleic acid molecule containing the 7 nucleotide deletion of mutation (b). 

1 1 . The diagnostic kit of claim 8 wherein a first said primer spans mutation (c) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (c) to prime amplification of a nucleic acid molecule 

15 lacking mutation (c). 

12. The diagnostic kit of claim 8 wherein a first said primer spans mutation (d) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (d) to prime amplification of a nucleic acid molecule 
lacking mutation (d). 

20 1 3. The diagnostic kit of claim 8 wherein a first said primer spans mutation (e) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (e) to prime amplification of a nucleic acid molecule 
lacking mutation (e). 

14. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
25 method comprising: 

obtaining a sample of material containing DNA from a said animal; and 
ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin is present, 
wherein the absence of DNA having said nucleotide sequence indicates the presence of 
30 muscular hyperplasia in the animal. 

15. The method of claim 14 wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin includes amplifying the DNA in the 
presence of primers based on a nucleotide sequence encoding a protein having biological activity 
of myostatin. 

35 16. The method of claim 15 wherein DNA of a said bovine animal not displaying muscular 
hyperplasia has a nucleotide sequence which is capable of hybridizing with a nucleic acid 
molecule having the sequence identified as SEQ ID NO:1 under stringent hybridization 
conditions. 
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17. The method of claim 14, wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of primers based on a nucleotide sequence encoding the N-terminal and the C- 
terminai, respectively, of the protein having biological activity of myostatin. 

5 18. The method of claim 14, wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of first and second primers based on first and second nucleotide sequences 
encoding spaced apart regions of the protein, wherein said regions flank a mutation known to 
naturally occur and which when present in both alleles of a said animal results in said muscular 

10 hyperplasia. 

19. The method of claim 18 wherein a DNA of said animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 1 1- 

1 5 base pair deletion beginning at base pair no. 821 , and said first primer is selected to be upstream 
of the codon encoding glutamic acid no. 275 and the second primer is selected to be 
downstream of the codon encoding aspartic acid no. 274. 

20. The method of claim 14 wherein a DNA of said animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 

20 sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 11- 
base pair deletion beginning at base pair no. 821 , and said primer is selected to span the 
nucleotide sequence including base pair nos. 820 and 821 of the DNA sequence containing said 
deletion. 

25 21 . The method of claim 1 9 wherein the animal is of a breed selected from Belgian Blue, 
Asturiana, Parthenaise and Rubia Gallega. 

22. The method of claim 20 wherein the animal is a breed selected from Belgian Blue, 
Asturiana, Parthenaise and Rubia Gallega. 

23. The method of claim 14 wherein ascertaining whether DNA having a nucleotide sequence 
30 encoding a protein having biological activity of myostatin is present includes amplifying the DNA 

in the presence of a primer containing at least a portion of a mutation known to naturally occur 
and which when present in both alleles of a said animal results in said muscular hyperplasia. 

24. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
method comprising: 

35 obtaining a sample of material containing DNA from a said animal; and 

ascertaining whether DNA having a mutation as defined in claim 7 is present; and 
ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin is present, 
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wherein the absence of DNA having said nucleotide sequence and presence of a said mutation 
indicates the presence of muscular hyperplasia in the animal. 

25. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
method comprising: 
5 obtaining a sample of the animal containing mRNA; and 

ascertaining whether an mRNA encoding a protein having biological activity of myostatin 
is present in the sample, 
wherein the absence of said mRNA indicates the presence of muscular hyperplasia in the 
animal. 

i 0 26. The method of claim 25 wherein the sample is of muscle tissue or wherein the tissue is 
skeletal muscle tissue. 

27. The method of claim 25 wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin includes amplifying the mRNA in the 
presence of primers substantially complementary to the nucleotide sequence encoding the 

15 protein. 

28. The method of claim 27 wherein mRNA of a said bovine animal not displaying muscular 
hyperplasia has a nucleotide sequence which is capable of hybridizing with a nucleic acid 
molecule having the sequence identified as SEQ ID NO;1 under stringent hybridization 
conditions. 

20 29. The method of claim 25, wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of primers substantially complementary to a nucleotide sequence encoding the 
N-terminal and the C-terminal, respectively, of the protein having biological activity of myostatin. 
30. The method of claim 25, wherein ascertaining whether mRNA having a nucleotide sequence 

25 encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of first and second primers substantially complementary to first and second 
nucleotide sequences encoding spaced apart regions of the protein, wherein said regions flank a 
mutation known to naturally occur and which when present in both alleles of a said animal results 
in said muscular hyperplasia. 

30 31 . The method of claim 30 wherein an mRNA of said animal not displaying muscular 

hyperplasia contains a nucleotide sequence which hybridizes under stringent conditions with a 
nucleotide sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the 
coding sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 
1 1 -base pair deletion beginning at base pair no. 821 , and said first primer is selected to be 

35 upstream of the codon encoding glutamic acid no. 275 and the second primer is selected to be 
downstream of the codon encoding aspartic acid no. 274. 

32. The method of claim 25 wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of a primer containing a nucleotide sequence complementary to at least a 
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portion of a mutation known to naturally occur in a said animal and which when present in both 
alleles of a said animal results in said muscular hyperplasia. 

33. The method of claim 32 wherein an mRNA of said animal not displaying muscular 
hyperplasia contains a nucleotide sequence which hybridizes under stringent conditions with a 

5 nucleotide sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the 
coding sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 
1 1-base pair deletion beginning at base pair no. 821 , and said primer is selected to span the 
deleted portion. 

34. The method of claim 31 wherein the animal is of a breed selected from Belgian Blue, 
10 Asturiana, Parthenaise and Rubia Gallega. 

35. A method for determining the presence of muscular hyperplasia in a mammal, the method 
comprising: 

obtaining a sample of material containing DNA from the mammal; and 
ascertaining whether a sequence of the DNA encoding (a) a protein having biological activity 
15 of myostatin, is present, and whether a sequence of the DNA encoding (b) an allelic 

protein lacking the activity of (a), is present; 
wherein the absence of (a) and the presence of (b) indicates the presence of muscular 

hyperplasia in the mammal. 

36. The method of claim 35 wherein (b) contains a naturally occurring mutation responsible for 
20 the lack of activity. 

37. The method of claim 35 wherein the mammal is a human. 

38. The method of claim 37 wherein ascertaining whether a sequence of the DNA encoding (a) 
is present, and whether a sequence of the DNA encoding (b) is present includes amplifying the 
DNA 

25 in the presence of primers based on a nucleotide sequence encoding a protein having biological 
activity of myostatin. 

39. The method of claim 38 wherein said primers are based on the sequence identified as SEQ 
ID NO:7. 

40. A method for determining the presence of muscular hyperplasia in a mammal, the method 
30 comprising: 

obtaining a sample of material containing mRNA from the mammal; and 

ascertaining whether a sequence of the mRNA encoding (a) a protein having biological 

activity of myostatin, is present, and whether a sequence of the mRNA encoding 

(b) a protein at least partially encoded by a truncated nucleotide sequence 
35 corresponding to substantially the sequence of the mRNA and lacking the 

activity of (a), is present; 
wherein the absence of (a) and the presence of (b) indicates the presence of muscular 

hyperplasia in the mammal. 
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41. The method of claim 40 wherein the mRNA encoding (a) and the truncated sequence 
correspond to alleles of DNA of the mammal. 

42. The method of claim 40 wherein the mammal is human. 

43. The method of claim 42 wherein ascertaining whether a sequence of the mRNA encoding 

5 (a) is present, and whether a sequence of the mRNA encoding (b) is present includes amplifying 
the mRNA in the presence of a pair of primers complementary to a nucleotide sequence 
encoding a protein having biological activity of myostatin. 

44. The method of claim 43 wherein each said primer contains a truncated nucleotide sequence 
substantially complementary to a portion of the sequence identified as SEQ ID NO:7. 

10 45. The method of claim 44 wherein the truncated sequence contains at least 50 consecutive 
nucleotides substantially corresponding to about 10, or between about 10 and 20, or between 
about 20 and 30, or between about 30 and 40, or between about 40 and 50 consecutive 
nucleotides of SEQ ID NO:7. 

46. A method for determining the presence of muscular hyperplasia in a mammal, the method 
15 comprising: 

obtaining a tissue sample of containing mRNA of the mammal; and 
ascertaining whether an mRNA encoding a mutant type myostatin protein lacking 
biological activity of myostatin is present, 
wherein the presence of a said mRNA encoding a mutant type myostatin protein indicates the 
20 presence of muscular hyperplasia in the mammal. 

47. The method of claim 46 wherein the mutant type myostatin protein lacing biological activity is 
encoded by a naturally occurring allele of DNA encoding the mRNA. 

48. A method for determining the presence of double muscling in a bovine animal, the method 
comprising: 

25 obtaining a sample of material containing DNA from the animal; and 

ascertaining whether the DNA contains the nucleotide coding sequence identified as 
SEQ ID NO:11, 

wherein absence of the sequence indicates double muscling in the animal. 

49. The method of claim 34 wherein the animal is of a breed selected from Belgian Blue, 
30 Asturiana, Parthenaise and Rubia Gallega. 

50. A method for determining the myostatin genotype of a mammal, comprising: 

obtaining a sample of material containing nucleic acid of the mammal, wherein the nucleic 

acid is uncontaminated by heterologous nucleic acid; 
ascertaining whether the sample contains a (i) nucleic acid molecule encoding a protein 
35 having biological activity of myostatin; and 

ascertaining whether the sample contains an (ii) allelic nucleic acid molecule encoding a 

protein lacking biological activity of myostatin. 

51 . The method of claim 50 wherein the mammal is human and (i) comprises a nucleic acid 
sequence substantially homologous with the sequence identified as SEQ ID NO:7. 
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52. A purified protein having biological activity of myostatin, and having an amino acid sequence 
identified as SEQ ID NO:2, or a conservatively substituted variant thereof. 

53. An isolated nucleic acid molecule encoding a protein of claim 52. 

54. An isolated nucleic acid molecule comprising a DNA molecule having the nucleotide 

5 sequence identified as SEQ ID NO:1 or which varies from the sequence due to the degeneracy 
of the genetic code, or a nucleic acid strand capable of hybridizing with at least one said nucleic 
acid molecule under stringent hybridization conditions. 

55. Isolated mRNA transcribed from DNA having a sequence which corresponds to a nucleic 
acid molecule according to claim 54. 

10 56. Isolated DNA having a sequence according to claim 54 in a recombinant cloning vector. 

57. A microbial cell containing and expressing heterologous DNA which is complementary a 
nucleic acid molecule of claim 54. 

58. A transfected cell line which expresses a protein of claim 52. 

59. A process for producing the protein of claim 52 comprising: 

15 preparing a DNA fragment including a nucleotide sequence which encodes said protein; 

incorporating the DNA fragment into an expression vector to obtain a recombinant DNA 
molecule which includes the DNA fragment and is capable of undergoing 
replication; 

transforming a host cell with said recombinant DNA molecule to produce a transformant 
20 which can express said protein; 

culturing the transformant to produce said protein; and 
recovering said protein from resulting cultured mixture. 

60. A method of increasing muscle mass in a mammal, comprising administering an effective 
amount of an antibody to myostatin to said mammal. 

25 61. A method of increasing muscle mass in a mammal, comprising raising an autoantibody to 
the myostatin the in the mammal. 

62. The method of claim 61 wherein raising the autoantibody includes administering a protein 
having myostatin activity to the mammal. 

63. A method of increasing muscle mass in a mammal in need thereof, comprising 

30 administering to the mammal an effective amount of an antisense nucleic acid or oligonucleotide 
substantially complementary to at least a portion of the sequence identified as SEQ ID NO:1 or 
SEQ ID NO:5, or SEQ ID NO:7. 

64. The method of claim 63 wherein the portion is at least 5 nucleotide bases in length. 

65. The method of claim 64 wherein the mammal is a bovine and the sequence is the sequence 
35 identified as SEQ ID NO:1 . 

66. A method of increasing muscle mass in a mammal, comprising administering to the 
mammal an effective amount of an antibody to the myostatin. 

67. A probe comprising a nucleic acid molecule sufficiently complementary with a sequence 
identified as SEQ ID NO:1, or its complement, so as to bind thereto under stringent conditions. 
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68. The probe of claim 67 wherein the sequence is between about 8 and about 1195 
nucleotides in length, or between about 15 and 1 195 nucleotides in length, or between about 25 
and 1195 nucleotides in length, or between about 35 and 1195 nucleotides in length, or between 
about 45 and 1195 nucleotides in length, or between about 55 and 1195 nucleotides in length, or 

5 between about 65 and 1195 nucleotides in length, or between about 75 and 1195 nucleotides in 
length, or between about 75 and 1195 nucleotides in length, or between about 85 and 1 195 
nucleotides in length, or between about 95 and 1195 nucleotides in length, or between about 105 
and 1 1 95 nucleotides in length, or between about 115 and 1 1 95 nucleotides in length. 

69. A method for identifying a nucleotide sequence of a mutant gene encoding a myostatin 
10 protein of a mammal displaying muscular hyperplasia, the method comprising: 

obtaining a sample of material containing DNA from the mammal; and 

probing the sample using a nucleic acid probe based on a nucleotide sequence of a 

known gene encoding myostatin in order to identify nucleotide sequence of the 

mutant gene. 

15 70. The method of claim 69, wherein the probe is based on a nucleotide sequence of a non- 
coding region of the gene. 

71 . The method of claim 70 wherein the probe is based on SEQ ID NO:54. 

72. The method of claim 71 wherein the probe is at least 8 nucleic acids in length. 

73. The method of claim 69, wherein the step of probing the sample includes exposing the DNA 
20 to the probe under hybridizing conditions and further comprising isolating hybridized nucleic acid 

molecules. 

74. The method of claim 73, further comprising the step of sequencing isolated DNA. 

75. The method of claim 69, wherein the mammal is a bovine mammal and the probe is based 
on a said nucleotide sequence identified as SEQ ID NO:1. 

25 76. The method of claim 74, further comprising the step of isolating and sequencing a cDNA or 
mRNA encoding the complete mutant myostatin protein. 

77. The method of claim 71 , further comprising the step of isolating and sequencing a functional 
wild type myostatin from a said mammal not displaying muscular hyperplasia. 

78. The method of claim 76, further comprising comparing the complete coding sequence of the 
30 complete mutant myostatin protein with, if the coding sequence for a functional wild type 

myostatin from a said mammal is previously known, (1) the known sequence, or if the coding 
sequence for a functional wild type myostatin from a said mammal is previously unknown, (2) the 
sequence determined according to claim 74 or claim 77, to determine the location of any 
mutation in the mutant gene. 
35 79. A method for determining the myostatin genotype of a mammal, wherein wild type myostatin 
of the mammal is substantially that of claim 78, comprising: 

obtaining a sample of material containing DNA from the mammal; and 
ascertaining whether the DNA contains a said mutation determined according to claim 
78. 
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80. A method for determining the myostatin genotype of a mammal, wherein wild type myostatin 
of the mammal is substantially that of claim 78, comprising: 

obtaining a sample of material containing mRNA from the mammal; and 
ascertaining whether the mRNA contains a said mutation determined according to claim 
5 78. 

81 . A primer composition useful for the detection of a nucleotide sequence encoding a myostatin 
comprising a first nucleic acid molecule based on a nucleotide sequence located upstream of a 
said mutation determined according to claim 78 and a second nucleic acid molecule based on a 
nucleotide sequence located downstream of the mutation. 

10 82. A probe comprising a nucleic acid molecule based on a nucleotide sequence of claim 74 or 
claim 76 and spanning a said mutation determined according to claim 78. 
83. A transgenic mammal having a phenotype characterized by muscular hyperplasia, said 
phenotype being conferred by a transgene contained in the somatic and germ cells of the 
mammal, the transgene encoding a myostatin protein having a dominant negative mutation. 

15 84. The transgenic mammal of claim 83 wherein the mammal is male and non-human and the 
transgene is located on the Y chromosome. 

85. The transgenic mammal of claim 83 wherein the mammal is bovine and the transgene is 
located to be under the control of a promoter which normally a promoter of a myosin gene. 

86. A transgenic mammal having a phenotype characterized by muscular hyperplasia, said 
20 phenotype being conferred by a transgene having a sequence antisense to that encoding a 

myostatin protein of the mammal. 

87. The transgenic mammal of claim 86 wherein the mammal is bovine and the transgene is 
located on the Y chromosome. 

88. The transgenic mammal of claim 86 wherein the transgene further comprises a sequence 
25 which when transcribed obtains an mRNA having ribozyme activity. 

89. A transgenic non-human mammal having a phenotype characterized by muscular 
hyperplasia, said phenotype being inducible and being conferred by a myostatin gene flanked by 
J oxP sides and a Cre transgene under the dependence of an inducible promoter. 

90. A transgenic non-human male mammal having a phenotype characterized by muscular 
30 hyperplasia , said phenotype being conferred by a myostatin gene flanked by J oxP sides and a 

Cre transgene located on the Y chromosome. 

91 . A method for determining whether a sample of mammalian genetic material is capable of a 
conferring a phenotype characterized by muscular hyperplasia, comprising ascertaining whether 
the genetic material contains a nucleotide sequence encoding a protein having biological activity 

35 of myostatin, wherein the absence of said sequence indicates the presence of muscular 
hyperplasia in the animal. 

92. A transgenic bovine having a genome lacking a gene encoding a protein having biological 
activity of myostatin. 
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93. A transgenic mouse having a genome containing a gene encoding a human protein having 
biological activity of myostatin or containing a gene encoding a bovine protein having biological 
activity of myostatin. 

94. A transgenic bovine having a gene encoding a bovine protein having biological activity of 
5 myostatin and heterologous nucleotide sequence antisense to the gene. 

95. A transgenic bovine of claim 94, further comprising a gene encoding a nucleic acid 
sequence having ribozyme activity and in transcriptional association with the nucleotide sequence 
antisense to the gene. 
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SEQUENCE ID NO. 1 

1 AGGAAGAATA AGAACAAGGG AAAAGATTGT AVTGATTTTA AAACCATGCA 

51 AAAACTGCAA ATCTCTGTTT ATATTTACCT ATTTATGCTG ATTGTTGCTO 

101 GCCCAGTGGA TCTGAATGAG AACAGCGAGC AGAAGGAAAA TGTGCAAAAA 
151 GAGGGGCTGT GTAATGCATG TTTGTGCAGG (SAAAACACTA OVTCCTCAAG 

201 ACTAGAAGCC ATAAAAATCC AAATCCTCAG TAAAC'ITTCGC CTGGAAACAG 

7 51 CTCCTAACAT CAGCAAAGAT GCTATCAGAC AACTTTTGCC CAAGGCTCCT 

301 CCACTCCTGG AACTGATTGA TCAGTTCGAT GTCCAGAGAG ATCCCAGCAG 

3 51 TGACGGCTCC TTGGAAGACG ATGACTACCA CGCCAGGACG GAAACCGTCA 

401 TTACCATGCC CACGGAGTCT GATCTTCTAA CGCAAGTGGA AGCAAAACCC 

451 AAATGTTGC? TCTTTAAATT TAGCTCTAAG ATACAATACA ATAAACTAGT 

501 AAAGGCCCAA C fGTGGAT AT ATCTGAGGCC TGTCAAGACT CCTGCGACAG 

551 TGTTTGTGCA AATCCTGAGA CTCATCAAAC CCATGAAAGA CGGTACAAGG 

601 TATACTGGAA TCCGATCTCT GAAACTTOAC ATGAACCCAG GCACTGGTAT 

651 TTGGCAGAGC ATTGATGTGA AGACAGTGTT GCAGAACTGG CTCAAACAAC 

701 CTGAATCCAA CTTAGGCATT GAAATCAAAG CriTAGATGA GAATGGCCAT 

751 G ATCTT GCTQ TAACCTTCCC AGAACCAGGA GAAGATGGAC TGACTCCTTl' 

801 TTTAGAAGTC AAGGTAACAG ACACACCAAA AAGATCTAGG AGAGA1TTTG 

851 GGCTTGATTG TGATGAACAC TCCACAGAAT CTCGATGCTG TCGTTACCCT 

901 CTAACTCTGG ATTTTGAAGC TTTTGGATGG GATTGGATTA TTGCACCTAA 

951 AAGATATAAG QCCAATTACT GCTTCTGGAGA ATGTGAATTT GTATTTTTGC 

1001 AAAAGTATCC TCATACCCAT CTTGTGCACC AAGCAAACCC CAGAGCTTCA 

1051 CCCGGCCCCT GCTGTACTCC TACAAAGATG TCTCC AATTA ATATGCTATA 

1101 TTTTAATGGC GAAGGACAAA TAATATACGG GAAGATTCCA GCC AM *CCTAG 

1151 TAGATCGCTG TGGGTGTTCA TGAGTCTATA TT ZGG gTTC A TAAGC 
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SEQUENCE ID NO. 2 



1 


♦WQKLQISVY 


IYLPMLIVAO 


PVDLNENSEQ 


kenvekuulc 


NACLWRENTT 


51 


SSRLEATKIQ 


ILSKI.RLETA 


FNISKDAIRQ 


LLPKAPPLLE 


LIDQFDVQRD 


101 


ASSDGSLEDD 


DYIIARTETTVT 


THPTESDLLT 


gVEGKPKCC? 


FKFSSKIQYK 


151 


KTiVKAQLWIY 


LiR PVKTPATV 


FVQILRLTKP 


MJUX3TRYTGX 


KSLKLDMNPG 


201 


TGTWQSIDVK 


TVLjQNWLKQP 


ESNLGIETKA 


LDKNGHDLAV 


TPPEPGEDGL 


251 


TPFLEVKVTD 


TPKRSRRDFC 


LrDCDEILSTES 


RCCKYPIiTVn 


FKAFGWDWII 


301 


APKRYKANYC 


SCECEFVFLQ 


KYPHTHIiVHQ 


ANPRGS AG PC 


CTPTKMSPIN 


351 


MLYFNGECQI 


IYGKIPAMW 


DRCGCS*<9*«- 
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SEQUENCE ID NO. 3 



1 AGGAAGAATA AGAACAAG GG AAAAGATTGT ATTGATl-ITA AAACCATGCA 
51 AAAACTCCAA ATCTCTGTTT ATATTTACCT ATTTATGC 1 'U ATTGTTGCTG 
101 GCCCAGTCCA TCTGAATGAG AACAGCGAGC AGAAGGAAAA 't*GTGGAAAAA 
15? GAGGGGCTGT GTAATGCATG TTTGTGGAGG GAAAACACTA CATCCTCAAG 
201 ACTAGAAGCC ATAAAAATCC AAATCCTCAG TAAACTTCGC CTGGAAACAG 
251 CTCCTAACAT CAGCAAAGAT GCTATCAGAC AACTn-|'(JCC CAAGGCTCCT 

3 01 CCACTCCTGG AACTGATTGA TCAGTTCGAT GTCCAGAGAC ATGCCAGCAG 
351 TGACGGCTCC TTGGAAGACG ATGACTACCA COCCAGGACG C AAAC GGTCA 

4 01 TTACCATGCC CACGGAGTCT GATCTTCTAA CGCAAGTGGA AGOAAAACCC 
451 AAATGTTGCT TCTTTAAATT TAGCTCTAAG ATACAATACA ATAAACTAGT 
501 AAAGGCCCAA CTGTGGATAT ATCTGAGGCC TGTCAAGACT CCTGCGACAG 
551 TGTTTGTGCA AATCCTGACA CTCATCAAAC CCATGAAAGA CGGTACAAGC 
601 TATACTGGAA TCCGATCTCT GAAACTTGAC ATGAACCCAG GCACTGGTAT 
651 TTGGCAGAGC ATTGATGTGA AGACAGTGTT GCAGAACTGG CTCAAACAAC 
701 CTGAATCCAA CTTACCCATT G AAATC AAA G CTTTAGATGA GAATGGCCAT 
751 GATCTTGGTG TAACCTTCCC AGAACCAGGA GAAGATGGAC TGACTCCTTT 
801 TTTAGAAGTC AAGGTAACAG ACACACCAAA AAGATCTAGG AGAGATTTTG 
851 GGCTTGATTG TGACAGAATC TCGATGCTGT CGTTACCCTC TAACTGTGGA 
901 TTTTGAAGCT TTTGGATGGG ATTGGATTAT TG C AC CT AAA AGATATAAGG 
951 CCAATTACTG CTCTGGAGAA TGTGAATTTG TATTTTTGCA AAAGTATCCT 

1001 CATACCCATC TTGTGCACCA AGCAAACCCC AGAGGTTCAG CCGGCCCCTC 
1051 CTGTACTCCT ACAAAGATGT CTCCAATTAA TATGCTATAT TTTAATGGCG 
1101 AAGGACAAAT AATATACGGG AAGATTCCAG CCATGGTAGT AAATCGCTGT 
1151 GGGTGTTCAT GACGTCTATA TTTGGTTCAT AGCTTCCTCA AACATGGAAG 
1201 GTCTTCCCCT CAACAATTTT GAAACTGTTG AAATTATGT 



« * 
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SEQUENCE ID NO. 4 



l Jm 



QKLQISVY I YLFML XVAG PVDLNENSEQ KENVEKEGhC NACLWKKNTT 

51 SSRIiEAIKXQ ILSKLRIiETA PNISKDAIRQ I.LPKATPLLE LILvFDVQRD 

101 ASSDGSLEDD DTHARTETVI TMPTESDLLT QVEGKPKCCF PK KSSKIQYN 

151 KLVKAQIiV/IY LRPVKTPATV FVpjr,RLXKP MKDGTRYTG1 RST.KUMNPG 

201 TGIWQSIDVK TVLQNWLKQP ESNT.GTEIKA LDENGHDIjAV TFPEPGftDGL. 

251 TPFLSVKVTD TPKR.SRROPG LDCDRISMLS LPSNCGF 
301 



351 
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SEQUENCE ID NO. 5 



PCT/IB98/01197 



1 GTCTCTCGGA CGGTACATGC ACTAATATTT CACTTGGCAT TACTCAAAAG 

Si CAAAAAGAAG AAATAAGAAC AAGGGAAAAA AAAAGATTGT GCTGATTTTT 

101 AAAATGATGC AAAAACTGCA AATGTATGTT TATATTTACC TCTTCATGCT 

151 GATTGCTGCT GGCCCAGTGG ATCTAAATGA GGGCAGTGAG AGAGAAGAAA 

201 ATGTGGAAAA AGAGGGGCTG TGTAATGCAT GTGCGTGGAG ACAAAACACG 

251 AGGTACTCCA GAATAGAACC CATAAAAATT CAAATCCTCA GTAAGCTGCG 

3 01 CCTGGAAACA GOTCCTAACA TCAGCAAAGA TGCTATAAGA CAACTVCTGC 

351 CAAGACCGCC TCCACTCCCG GAACTGATCG ATCAGTACGA CGTCCAGAGG 

401 GATGACAGCA GTGATGGCTC TTTGGAAGAT GACGATTATC ACGCTACCAC 

451 GGAAACAATC ATTACCATGC CTACAGAGTC TGAGTTTCTA ATGCAAGCGG 

501 ATGGCAAGCC CAAATGTTGC TTTTTTAAAT TTAGCTCTAA AATACAGTAC 

551 AACAAACTAG TAAAAGCCCA ACTGTGGATA TATCTCAGAC CCGTCAAGAC 

601 TCCTACAACA GTQTTTCTGC AAAT CCTGAG ACTCATCAAA CCCATGAAAG 

651 JiCC CTACAAG GTATACTGGA ATCCGATCTC TOAAACTTGA CATGAGCCCA 

701 GGCACTGGTA TTTGGCAGAG TATTGATGTG AAGACAGTGl' TGCAAAATTG 

751 G CTCAAACAG CXTGAATCCA ACTTAGGCAT TGAAATCAAA GCTTTGGATG 

801 AGAATGGCCA TGATCTTCCT GTAACCTTCC CAGGACCAGG AGAAGATGGG 

851 CTGAATCCCT TTTTAGAACT CAAGGTGACA GACACACCCA AG AGO* l'CC C G 

901 GAGAGACTTT GGGCTTGACT CCGATGAGCA CTCCACGGAA TCCCGGTGCT 

951 GCCGCTACCC CCTCACGGTC GATTTTGAAG CCTTTGGATG GGACTGGATT 

1001 ATCGCACCCA AAAGATATAA GGCCAATTAC TGCTCAGGAG AGTGTGAATT 

1051 TGTGTTTTTA CAAAAATATC CGCATACTCA TCTTGTGCAC CAAGCAAACC 

1101 CCAGAGGCTC AGCAGGCCCT TGCTGCACTC CGACAAAAAT GTCTCCCAT*T 

1151 /^TATGCTAT ATTTTAATGG CAAAGAACAA ATAATAT AT G GGAAAATTCC 

1201 AGCCATGGTA GTAGACCGCX CTCGGTGOTC ATGAGCTTTG CATTAGGTTA 

1251 GAAACTTCCC AAGTCATGGA AGGTCTTCCC CTCAATTTCG AAACTGTGAA 
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1301 TTCAAGOACC ACAGGCTGTA GGCCTTGAGT ATGCTCTACT AACGTAAGCA 
1351 CAAGCTACAG TGTATGAACT AAAAGAGAGA ATAGATGCAA TGGTTGGCAT 
1401 TGAACCACCA AAATAAACCA TACTATAGGA TGTTGTATGA TTTCCAGAGT 
1451 TTTTGAAATA GATGGAGATC AAATTACATT TATGTCCATA TATCITATATT 
1501 ACAACTACAA TCTACCCAAG GAAGTGAGAG CACATCTTGT GGTCTGCTUA 
1551 GTTAGGAGGG TATGATTAAA AGGTAAAGTC TTATTTCCTA ACAGTTTCAC 
1601 TTAATATTTA CAGAACAATC TATATGTAGC CTTTGTAAAG TGTAGGATTG 
1651 TTATCATTTA AAAACATCAT GTACACTTAT ATTTGTATTG T ATAC1TG GT 

17 01 AAGATAAAAT T C CACAAAGT AGGAATGGGG CCTCACATAC ACATTGCCAT 
1751 T C C T ATT ATA AT TGGACAAT CCACCACGGT GCTAATGCAG TGCTCAATGG 

IB 01 CTCCTACTGG ACCTCTCGAT AGAAC ACT CT ACAAAGTACG AGTCTCTCTC 

1851 TCCCTTCCAG GTGCATCTCC ACACACACAG CACTAAGTGT TCAATGCATT 

1901 TTCTTTAAGG AAAGAAGAAT CTTTTTTTCT AGAGGTCAAC TTTCAGTCAA 

1951 CTCTAGCACA GCGGGAGTGA CTG C TGCATC TTAAAAGGCA GCCAAACAGT 

2001 ATTCATTTTT TAATCTAAAT TTCAAAATCA CTGTCTGCCT TTATCACATG 

7 051 CCAATTTTGT GGTAAAATAA TGGAAATGAC TGGTTCTATC AATATTGTAT 

2101 AAAAGACTCT GAAACAATTA CATTTATATA AT AT GT AT AC AATATTGTTT 

2151 TGTAAATAAG TGTCTCCTTT TATATTTACT TTGGTATATT TTTACACTAA 

22D1 TGAAATTTCA AATCATTAAA GTACAAAGAC ATGTCATGTA TCACAAAAAA 

2231 GGTGACTGCT TCTATTTCAG AGTGAATTAG CAGATTCAAT AGTGGTCTTA 

2301 AAACTCTGTA TGTTAAGATT AGAAGGTTAT ATT AC AATC A ATTTATGTAT 

2351 TTTTTACATT ATCAACTTAT GGTTTCATGG TGGCTGTATC TATGAATGTG 

2401 GCTCCCAGTC AAATTTCAAT GCCCCACCAT TTTAAAAATT A CAAGCATTA 

2451 CTAAAOATAC CAACATGTAT CTAAAGAAAT ACAAATATGG TATCTCAATA 

2501 ACAGCTACTT TTTTATTTTA TAATTTGACA ATGAATACAT TTCTTTTATT 

2551 TACTTCAGTT TTATAAATTG GAACTTTGTT TATCAAATGT ATTGTACTCA 

2 601 TAGCTAAATG AAATTATTTC TTACATAAAA ATGTGTAGAA ACTATAAATT 

2651 AAAGTGTTTT CACATTTTTG AAACCC 
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SEQUENCE ID NO. 6 



1 


MMQKL QltfYVY 


XYL.FML.XAAG 


P VDIiNEGS ER 


EElQVETCEGLiC 


NACAWRQNTR 


51 


YSRTKMKIQ 


ILSKLRIjETA 


PNISKDAIRQ 


LLPRAPPLRE 


IjXDQYCfVQRD 


101 


DSSDGST.EDP 


DYHATTETZI 


TMPTESDFLM 


QADGKPKCCF 


FKFSSKXQYN 


151 


KWKAQLWIY 


LRFVKT PTTV 


FVQIIiRIiXKP 


MKDGTRYTGT 


RST,KT>UMSFG 


201 


TGIWQSIDVK 


TVLQNWLKQP 


ESNLGXEXKA 


L.DENGHDLAV 


TFPGPGEDGL 


251 


NPFLEVKVTD 


TPKRSRRDFG 


LDCDEHSTES 


RCCRYPLTVP 


FEAFGWDWXX 


301 


APKRYKANYC 


SGECEFVFL.Q 


KYPHTHLVHQ 


ANPRGSAGPC 


CTPTKMSPIN 


351 


HLYTNGKEQI 


IYGKIPAMW 


DRCGCS* 
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SEQUENCE ID NO. 7 



PCT/IB98/01197 



. 0 „, W aATTTTCTAATOCAAGTtXIATCCAAAACCC 



hc3753 AAATC 



TTGCTTCTTTAAATTTAQCTCTAAAATACAATACAATAAAGTACTAAAGGCCCAA 



ba3753 



CTATCCATATATTTGAGACCCGTCGAaACTCCTACAACAGTOTTTGTCCAAATCCTGAGA 



bc3753 



CTCXTCAAACCTATGAAA<lACG<TrACAAOCTATACT 
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h*375Ii ATCAXCCCAGOOLCTQO^^ 



h*37S* GATCTTQ CTOTAACCTT C CCA CCA CCA CX? AAGAAGA TGGQCTfiA A TC CCTTTTTTAAOAA 



hr3753 



*^* T ^^XAACAUACACA CCA AAAA OATTC CAUAA<?^GATTTTCCGTX7TTCiA CTOOTGA 



hf! 3 7 5 3 TGAC*L*CTCAACAaAATCACCATCCTCT^ 



ho 37 5? TTTCCATCCOATTOOA - TATCO 



ho783 3 A(K:aATaoTAC/rAaACcacTaToooTocTc; 



ha7B23 ATOAGATTTATATTAAGCCTTCATAACTTCCT^ 
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h*7823 TTTTCAAOGTCTC^AATTAAGTACCACA 



hfl7823 AAOCATJU/KrTACACTAT<ZrAAACTAAAA<^^ 



h&7Q2'J TTTAACCATCGAAACAAATCATAC - - CA GAAAGTTTTATGATTTCCAKAGTTTTTTKAOO 



h»7 023 CNAO^AAO<MO<^<rrCAAANrTTCA^ 



h3 202 7 ATTTC G GCA C A GGTNAAA CA CTT GAA TTT A TATT CTTA T CK?T A G TATA 



h92027 CTT GGTAA GA TAAAATTC CA C AAAAAT A QO GAT<3 QTG CA GCATAT G CA -A TTT CCATTCC 



h92027 TATTATAATTGACACAOTACATTAACAATCCATGCCAACGCTGCTA 
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a9 5 3 2 7 TAAATCTGAACCJTTCCATTATTTTAATACTTOCAAAAACATTACTAAOTATACCAAAATA 



n9 5 3 2 7 ATTOACTCTATTATC - TO - AAATOAAO - AATAAACTQATOCXAXCTCAACAATAACXOTT 



u9 5 3 2 7 A CrTTTTATTTT A TAATT^TCLA TAAT GAAT AT ATTTCT OCA TTTA TTTA CT^PGT QTTTTOT A 



nS 5 3 2 7 AATTOGGATTTTGrTAArCAAATTTATTGTACT - ATGACTAAATGAAATTATTTCTTACA 



rx9 5 3 2 7 T - CTAA TTTOTA QAAA CAQTATAA QTTAT AT^AAAGTGTTTTCACA U^l^l ^ ITI^ gAAAgA C 
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SEQUENCE ED NO. 8 



1 


MQKLQLCVYI 


YLFMLIVAGP 


VDLNENSEQK 


ENVEKEGLCN 


ACTWRQNTKS 


51 


SRIEAIKIQI 


LSKLRLETAP 


NISKBVIRQL 


LPKAPPLREL 


IDQYDVQRDD 


101 


SSDGSLEDDD 


YHATTETIIT 


MPTESDFLMQ 


VDGKPKCCFF 


KFSSKIQYNK 


151 


WKAQLWIYL 


RFVETPITVF 


VQILRLIKPM 


KDGTRYTGIR 


SLKLDKMPGT 


201 


GIWQSIDVKT 


VLQNWLKQPE 


SIvTLGlEIKAL 


DENGHDLAVT 


FPGPGEDGLN 


251 


PFLEVKVTDT 


PKRSRRDFGL 


DCDEHSTESR 


CCRYPLTVDF 


EAFGWDWIIA 


301 


PKRYKANYCS 


GECEFVFLQK 


YPHTHLVHQA 


NPRGSAGPCC 


TPTKMSPINM 


351 


LYFNGKEQII 


YGKI PAMWD 


RCGCS* 
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SEQUENCE ID NO. 54 



1 


GCGGCCGCCC 


GGGCAGGTAT 


CGAAAGTTTC 


ACATATAAAG 


AT G AATAAGA 


51 


TCTAAGTGTA 


TATGTTATTG 


TTAATAAAGT 


TTTTAATTTT 


TCGAATGTCA 


101 


CAT AC AG C C T 


TTATTATTCA 


TAGATTTATT 


CCTTTTAAGA 


AG TAG T C AAA 


151 


TGAATCAGCT 


CACCCTTGAC 


TGTAACAAAA 


TACTGTTTGG 


TGACTTGTGA 


201 


CAGACAGGGT 


TTTAACCTCT 


G AC AG C GAGA 


TTCATTGTGG 


AG CAAGAG C C 


251 


AATCACAGAT 


CCCGACGACA 


CTTGTCTCATCAAAGTTGGA ATATAAAAAG 


301 


CCACTTGGAA 


T ACAG TAT AAAAGAT T C AC T GGTGTGGCAA GTTGTCTCTA 


351 


GACTGGGCAG 






GTTACTCAAA 


AG C AAAAG AA 


401 


AAGTAAAAGG 


AAG AAG T AAG 




AAGATTGTAT 


TGATTTTAAA 


451 


AC C ATGC AAA 


AACTGCAAAT 




ATTTACCTAT 


TTATGCTGAT 


501 


TGTTGCTGGC 


CCAGTGGATC 




CAGCGAGCAG 


AAGGAAAATG 


551 


TGGAAAAAGA 


GGGGCTGTGT 


AATGCATGTT 


TGTGGAGGGA 


AAAC AC T AC A 


601 


TCCTCAAGAC 


TAG AAG C C A T 


AAAAAT CC AA 


ATCCTCAGTA 


AACTTCGCCT 


651 


GGAAACAGCT 


CCTAACATCA 


G C AAA GAT G C 


TAT CAGAC AA 


CTTTTGCCCA 


7 01 


AGGCTCCTCC 


ACTCCTGGAA 


CTGATTGATC 


AGTTCGATGT 


CC AG AG AG AT 


/ o 1 


GCCAGCAGTG 


ACGGCTCCTT 


GG AAG AC GAT 


GACTACCACG 


CCAGGACGGA 


8 01 


AACGGTCATT 


ACCATGCCCA 


CGGAGT/GTGA 


h GTAGTCCTGCTGGTGCAAAG 


851 


CAACGACTCT 


GCTGACTGCT 


GTTCTAGTGT 


T CAT G AAAAA 


CCGATCTATT 


901 


TTCAGGCTCT 


TTTAACAAGC 


TGCTGGCTTG 


TATGTAAGGA 


GGAGGGGAAA 


951 


GAGCTTTTTT 


CAAGATTTCA 


TGAGAAATAG 


ACCAATGAGA 


CTGAAAGCTG 


1001 


CTACTTTATT 


TGTTTCCTTA 


GAG AG C T AAA 


AAG CT AAAAA 


T C AAAAAT G A 


1051 


AATGCTTGCA 


TAG CAT T CAT 


GTTATATAGT 


TTAGTATGAC 


• 

AACTATAACA 


1101 


TGTTTATGTT 


TTCACAGCTT 


AATGCTACCA 


AGGTAAAGGA 


TTGGGAAACA 


1151 


GTATCAGCAA 


TGTGAAAAAT 


TTACATCAAA 


TTTCCTAATT 


GCATTTGGTT 
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12 01 GCCTGAAATA TGCATTTATA ATAACAGGTT 

1251 AG AG AAAG G A AGAAATCTGT AGAGGTTGAA 

1301 AACACTTAGA ATGACTTCTG TTATTCAAAA 

1351 ATGGTCTTCA CAGAGTATCT AATTTTGAAA 

14 01 AAAAGAATAT TCTTAATAAA CTTAATGTAT 

14 51 TAAACACAGC ATAG TGAAAA AT CAT GAG CT 

1501 AAATAAACAT TTTAATTACA AAGTTCCACT 

1551 ATTGTTGAGA GTACCTTGTC TGCACATATC 

1601 ACCTTCTAAA ATATTATTGT ATTCCTCATA 

1651 TAT G TAG T AC CTATGTTGTT T C T G AAAG AT 

17 01 GTTGCAGTCA CTTCAAACCT ATACTCAAGG 

17 51 AC AG AG AAG G CAT G AC C AG A AAGAGTTTTG 

18 01 GCTTTATACA GGGCTCTACC CACTTTAAAC 
18 51 AATACTGCTT TTTCTTATTA AGTAACTAGT 
1901 TTCCTTTAAG ACTGTGCTAT CAGATAATCC 
1951 TATAAACAAT CTTGAGAAAA CAAAAAGGCA 
2 001 GCTTACAATG ACAGCCTGGC CCTAAAGACA 
2 051 CAGCTTGAAT ACAACATCTA AGTTTTGGTG 
2101 TTATTTTTTT CCTTTAAAAG GCTGTCCCAG 
2151 AC TAT ATT TT CTGCTAATTC CCGAGGCTCA 
2201 TGTCCCCAGG TAATTCAGGC CTGGGGGAAG 

22 51 TTGGTACAGC TGCTCAGTAA GTGTAACTAC 
2301 AAGTGGATGT TCTTCCACAG TGTCTCTTGT 

23 51 TTAAAATTTC ATCCACTTTT CATTCCTTAA 

24 01 AGTTCTCTGG AAAG G AAG T A GGCTTCTCAT 
24 51 CCTAAAAGAT TCTGAAAAGC TGTAATAACT 



PCT/IB98/01197 

TTTTTTTTTT CAT TAAT AAA 
GCCTATCTGG GCATTTGCTG 
CTATTTCTCA TAGGGTTTTT 
GCTATTAGAG TGGAAAGGAT 
TAG T AAG AG C AATAAGGAAG 
AATCAGCAGA AAATTCTAAG 
TATACCCTGA CCATGGTACT 
TAGGAGGCAC ATGCTTAATA 
GGAGGGAGAA CTATTACCTA 
AATATGTTTC ATGTATTTCT 
AAAG G GAG AC AGGCATCTCA 
TGCCATGTGT CTGCGATCTT 
TGGACTCAAA ACAGTTTCAA 
TTATAAGGCA ACAAATAAAT 
T G G AA TAG AT TTGCCTTACT 
AGAAATTGCT AAGTGCTTCT 
ATGTTTTCTA AGTTTTGAAA 
CTAATTACCT GCTAGTTTTT 
CGTCCTAACA TAACAGATGC 
GTTAGTTGCT CACTGTGTCT 
GGTTCCTTCC TCCAGACTGA 
TCAGATTCCC AAAGAAT T C T 
TCTCTCTAAT CATCATCATT 
TAGAATTTTC CTTAGTCCAC 
AhCAGCTGAA AAAACATATA 
GTTATACTTG ATATTTTGCT 
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2501 GTTATGAATG AAATGCTACA TATTTTTCCA TTTTAAAA.GA CTAAATATGC 

2551 AC AC AT T AT T C C AAT T AAAA AAT G T T CAT A GATTGATATGGAGGTGTTCG 

2 601 TTCATTTTTC ATAAAAATGA TCTTAGTAAC TTTTTTTCTT ATTCATTTAT 

2 651 AG/CTGATCTT C T AAC G CAAG T G G AAG G AAA ACCCAAATGT TGCTTCTTTA 

2701 AATTTAGCTC TAAGATACAA TACAATAAAC TAGTAAAGGC CCAACTGTGG 

27 51 ATATATCTGA GGCCTGTCAA GACTCCTGCG ACAGTGTTTG TGCAAATCCT 

28 01 GAGACTCATC AAACCCATGA AAGACGGTAC AAGGTATACT GGAATCCGAT 
2 8 51 CTCTGAAACT T G AC AT G AAC CCAGGCACTG GTATTTGGCA GAGCATTGAT 

2 901 GTGAAGACAG TGTTGCAGAA CTGGCTCAAA CAACCTGAAT CCAACTTAGG 
2951 CATTGAAATC AAAGCTTTAG ATGAGAATGG CCATGATCTT GCTGTAACCT 

3 001 TCCCAGAACC AG GAG AAG AT GGACTG/GTAA G T GAT T AC T G AAAAT AAC AT 
3 051 GCTAAAAACC TTGTTATGTG TTTATTCATA ATGTGAATGA AT AG TAG T G A 
3101 AAAAT AAC TA CCAGTTTCCT GTGCTTATAA GCCAGACAAA GGCACCTTAC 
3151 CCCAGTGGTA GCCCTGTACT CAATAAAAGT AGGTGTCCCA TTTCACATCC 
32 01 TAT GAAAC AC TCTCTTGATA CTTTGACTTT GCATGAGGAT TTAAAAGAAA 

32 51 AAAAG T TATA CCATGGTCCT TAAGTTTTTA GGGAATTCTT TGGAATTGAG 
3301 AATGAAATAT AAAATGCTTT CCGTTGATGT GCTACATGAT TATATAAATA 

33 51 AAAAC AT G AA GTCTTCACAG TGGATTCTAG TACTCACCCA ACAACACATT 

34 01 TTTTCCCCCA G AAG AG T G AC CAATTTGTTA AAATTCTTTT GCTTAATAAG 
3 4 51 GCAGAAAAAT GAACTCTACA AGTTATAATT AAAATAAAAT GCTTTTACTT 
3501 AT AGAAAT T A ACTAGATATA TGTTCAGGTT TATATACTAT T AAA TAT AC T 
3551 ATATTTAAGA TCTCTCATGA TAAATATGTT CCTTGTTTTA TAGACTATTG 
3 601 ATGCACTGAT GTATATGTGG ATTACTTTGT GAATTACCCC TGGTAAAATT 
3 651 AAAAATTTCA GGCTAGTTAA CTTGTACTAC TTAGCTATTT TCTGAACTGT 
37 01 CTTACTGTTC TTTAACAGGA GTTAACTTAG GTAATGTCAA CTAATTTAAT 
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3751 ATAAAGTCAA ACAGAAAATA ATGCCTTATA TATTATAAAA ATTAATAAAA 

38 01 AACCATTTTA AAATCTAGTA T AAG T T TAG A GCTACTCACT CTTCTGGCTT 

3 8 51 ATCTATGCTT GTATTTACTT CTGTTTTCAA AAAATTTTTT AATGTGACCA 
3901 TACCTTTTAT TTCCAGTTAT TGATATAATT TACAACAAAA GAT TAT AC T T 
3951 GCAAGCTTTA TAGTTTTTAA ATGGTCTTAT TTGTAGTGAA TATCATATCT 

4 001 AAATGATATC T AAAT G T AAA G T AAAT CAT A CCTAAATGAA AACATATTCT 
4 051 TTAAGTCATT ATAAAATTTT CCAGGTGATC AATTTTTCTT TAAAT AT AC T 
4101 ACATAAAATG TTATTGACTC C C AAAAT G AT GTTATTTTGT ATAATCTTAA 
4151 ATACCAATAA TTACCAGGTC TATTTTGGTT TTAGTGTAGG ATAAAAAAGA 
4201 ATGTGTTCTT TTTTCTAGGT AG CAT T T T AA TGATCAAAGT TGGTGACGTG 
4 2 51 ACAGAGGTCT T AAG TAT TAT T AAAC AG AT G ATTAATAAGA TGTATTCCTC 
4 3 01 AGACTTTTCC AT AT AAAAG G AAAAATGTCT CAAATTCATG AAAAGATTGG 
4 351 TACAGGAGGA GGATTAGCAA ATTGTAGTTT AAAT AT C T G A AT G G AAAC AC 
4 4 01 TTTTTAGTGA AAGAATAAAG GGAATATCAT TGTATCTTCT TCTGAGTCTG 
4 4 51 TGCCTCTCTC TCTTGGAGTT AGTCTTTCCA ACCCTATATA CTTACCACTA 
4 501 TCTTCATCCC TCTACCTTCC TTTTTCCCAT TACATCTGTG CAGTACTGGG 
4 551 TGGCAACTAT TGTGTTTCGG TGTTAATATC CAAGTTTCCC TGAATAAGAC 
4 601 CAAGTGAATG GAGGATGAAT GAGTATACCT ATCCCTCCAG GGGTCATCAG 
4 651 ACATATTTAG CCACCATATT TAATCAATAA GCAGGAAGAC ATAAGCTAGC 
4701 CTTGTCCTTC TTCTTTCCTC CCTGCTCCTT TCTCTTCTCT TCCCCCTCTC 
4 7 51 CCTTTACTGT CATCCATCAG TATTTTCAGA GCATCTATTA TGTGTCAGGC 
4 801 AT TC AG AT AC T C AAAC G GAG GAAAACAAGA. ATAAACAAGA CAAAGATCTG 
4 851 ACCACAGGGG AATCCCTATG GCTACTGTAG ACTTTTGAGC CATAAAGGAA 
4 901 GAATCAAGCC TAG T G TAAAT GAAAATTCCT TAATGCTGTG CCTT T TAAAA 
4 951 AGAAATGTGA CATAAGCAAA ATGATTAGTTTCTTT CTTTA AT AA T GAG T C 
5001 CTTGAGGTAG GAGAGTGTTT TGGGATCTATTATTAACTCT TCTTTCCTTT 
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5051 CCATACAG/AC TCCTTTTTTA GAAGT CAAGG TAACAGACAC ACCAAAAAGA 

5101 T C TAGGAG AG ATTTTGGGCT TGATTGTGAT GAACAC TCCA CAGAATCTGG 

5151 ATGCTGTCGT TACCCTCTAA CTGTGGATTT TGAAGCTTTT GGATGGGATT 

5201 GGATTATTGC ACCTAAAAGA TATAAGGCCA ATTACTGCTC TGGAGAATGT 

5251 GAATTTGTAT TTTTGCAAAA GTATCCTCAT ACCCATCTTG TGCACCAAGC 

5301 AAACCCCAGA GGTTCAGCCG GCCCCTGCTG TACTCCTACA AAGATGTCTC 

5351 CAATTAATAT GCTATATTTT AATGGCGAAG GACAAATAAT ATACGGGAAG 

54 01 ATTCCAGCCA T G G TAG TAG A TCGCTGTGGGTGTTCATGAG GTCTATATTT 

54 51 GGTTCATAGC TTCCTCAAAC ATGGAAGGTC TTCCCCTCAA CAATTTTGAA 

5501 ACTGTGAAAT TATGTACCAC AGGCTATAAG C C TAG AG TAT G C T AC AG T C A 

5551 CTTAAGCACA AG C T AC AG T A TATGAGCTAA AAAGAGAGAA TATATGCAAT 

5601 GGTTGGCATT TAACCATCCA AACAAATCGT ATAATAAAAA GTTTTATGAT 

5651 TTCCAGAGTT TTTGAACTAG GAG AT C AAAT TCCATTTATG TTGAAATATA 

5701 TTACAACACA TGCAGGTGAA TGAAAGCAAT TCTCCTTGTC TTCTGGTGAA 

57 51 TTAAAGGAGT ATGCTTTAAA ATCTATTTCT CTACAGTTTC 
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